Things Learned
-
Technical Area
→ Machine learning basic:
-
Difference between Programming and Machine learning
-
Define ML problems: Classification v.s. Regression
-
Types of variables: supervised learning v.s. Unsupervised Learning
-
Different filtering data method: content based filtering v.s. Collaborating filtering
-
Caculating similarity method: cos, Dot product eucidean distance
-
Useful model for data filtering: K-nearest neighbors
-
Evaluation ways for different output: MSE and Classification matrix
→ NLP
-
Pros and cons of : vanilla neural networks, RNNS and LSTM
-
Modern NLP: attention mechanisms and the transformer (Key, Query, Value)
-
Basic Process:
→ Encoder: take sentence to vectors
- BERT: Features, How it trained, components
→ Decoder: take vectors and weight to translate
- GPT2
-
Multihead attention:
- Core attention model
-
-
-
Tool
→ Prepare working environment:
Beautiful Soap, Selenium, Github, VS code
→ Have a basic idea of:
Pytorch, word enveddings, BERT, Fine Tuning BERT for Classification, Multi-Label Classification, Simple Transformers
-
Soft Skills
→ Managing a group
Keep members to participate by answer their question in time and efficiently
Track their progress using trello by checkboxs
For time conflict for meeting, record is always a good choice
→ Time management
Listing out personal task for each week, I also found trello can be used for self task and time management which is great
→ Problem Solving
Stem away provide many useful information and guide for solving recommendation system ML problems, it is always a good way to learn from others, videos and web resource when I meet problem
Three achievement highlights
-
Gathering all my team members
- Creating Slack and Whatsapp group
- Creating Trello task for members to track their weekly progress
- Held first welcome meeting with members and answer their questions
-
Successfully scraping data from Discourse Hub website
-
Have a hands on practice on NLP
Tasks Completed Process
-
Gathering members
- I posted one team set up forum and provide collective useful information for my members
- I sent slack and trello invitation to them by finding all members username
- After that I sent Whats app invitation using both their numbers on the post and sharing whatsapp group QR code on slack
Result
- Turned out working very well, we have only one person unable to connect and one person move to next session
- Members are very active on Whatsapp and Slack. Whatsapp is good for quick conversation and Slack is good for technology information sharing
-
Scraping data from Discourse Hub
- I followed steps on video to scraping data
Result
- Very useful tools and successfully get data.