Name: Yutong Wang Team: ML Team 3 (Predictors)
Overview of Things Learned:
Technical Area: I did research on data annotation tools available for both enterprise use and personal use, and wrote a report of one of them. For machine learning part, I have built a web crawler for https://ai.stackexchange.com/ using scrapy library in order to get question titles, tags, individual question links and question content data.
Tools Used: Python, Pandas, JSON, Scrapy, Git, Colab, Visual Studio Code
Soft Skills: Communication with the team leader, tech leader and teammates about questions and confusion.
Achievement Highlights
- Successfully scraping the StackExchange,ai website through Scrapy
- Research and wrote a report of a data annotation tool called Scale
List of Meetings attended
- All team meetings except the first time
- Watch the other webinar recordings
Goals for the Upcoming Week
Build the training model for the data.
Tasks Done
- Task: Successfully scraping the StackExchange,ai website through Scrapy Hurdles: Having issues using scrapy to crawl at first, and the StackExchange website banned users from accessing once one IP address performed a high number of requests.
- Task: Research and wrote a report of a data annotation tool called Scale Hurdles: Very new to the idea of data annotation tool.