Machine Learning - level 1 module 3

Technical Areas:

  • load csv data using pandas et visualize the data

  • data exploration and observe that the data is unbalance

  • text preprocessing

  • Simple classification model using Naive Bayes, Decision Tree, Logistic Regression and SVM

Tools:

  • Visual Studio

  • Pandas

  • sklearn

  • nltk

Soft skills:

  • Work with my team to discuss and debug together

  • check youtube channel for, data cleaning, ML and NLP course

Highlights:

  • load csv file , perform data exploration and data cleaning

  • Modeling the data with some basic classification algorithms

  • compute recall precision and f1 score for each algorithms

Next step

  • Need to gather more data

  • deal with unbalanced data

  • build the basic recommender system