Mutaz_ahmed - Machine Learning Pathway

mutaz_ahmed · June 25, 2020, 2:42am

Weeks 1 + 2
Concise Overview of Things Learned:

Technical Area:

Tools:

Soft Skills:

Three Achievement Highlights:

Use the CountVectorizer and Logistic Regression on the Amazon dataset.
Build TF-IDF Logistic Regression model used on the Amazon dataset which yielded an accuracy level of 74%
Present to my team about the Count Vectorizer model and present Team Amazon findings with all the models run.

Meetings Attended:

Tasks Completed:

I was able to learn about the count vectorizer word embedding and make some initial models to see how it functions.
I then created a presentation and ran my team through what I learned, most notably that count vectorizer is a lite version of TF-IDF word embedding which should generally be used over count vectorizer.
Some hurdles I faced this week were that I had a lot of work from my courses in school as it was the end of one of my summer classes. Initially, I was meant to present my word embedding on Monday 6/15 but thankfully, Rohit and Sara allowed me to present on 6/16 instead.
I also was able to research and learn more about LSTM and BERT models which function differently from the previous models that we’ve discussed. I look forward to working with this through this next portion of the project.

Goals for Next Week:

Work on the TF-IDF and Logistic Regression model on the combined dataset from the Flowster and Amazon teams.
Also play around with BERT models to learn the difference between this method and previous.

mutaz_ahmed · July 9, 2020, 5:52pm

Concise Overview of Things Learned:

Technical Area:

Tools:

Soft Skills:

Three Achievement Highlights:

Built TF-IDF & Logistic Regression model to be used on the non-augmented merged dataset
Built TF-IDF & Logistic Regression model to be used on the augmented merged dataset.
Hypertuned the C parameter and the max_iter parameter using gridsearchcv through SKLearn

Meetings Attended:

Tasks Completed:

Built TF-IDF & Logistic Regression classifiers that ran on the merged dataset and the augmented dataset.
With both models, I used hypertuning on the C and max iteration parameter of logistic regression to try and optimize the results
I also contributed to the final presentation with the parts I completed for the project.
One hurdle overcome was that hypertuning took a very long time train, depending on how much I varied the parameters, it could take hours. The google colab runtime kept disconnecting before the model would finish training due to inactivity. In order to solve this, I found a script online that, when pasted into the browser console, would imitate the user clicking on page every couple minutes, thus allowing the program to continue running.