Scraping Smarter with Content Filtering

Scrapy is a powerful web scraping framework and essential tool for building machine learning datasets. When a site has a particularly simple structure, scrapy makes it easy to get a spider running to build up a curated dataset. Check out the tutorials in scrapy’s documentation. For example, to train a poster similarity model, we first needed to gather many movie posters. Consider trying to scrape IMDb.com.We may be interested in gathering posters from <img> tags under <div> tags with the class "poster"....

 · 3 min · Terry Rodriguez & Salma Mayorquin

Population Health Modeling

In a matter of months, the COVID-19 pandemic has besieged humanity and now the world wrestles to manage the population health challenges of a novel coronavirus with remarkable infectivity. Organizing an effective response to blunt the impact of such a large, complex challenge demands a principled and scientific approach. Better Planning by Forecasting Infections Reliable forecasting is crucial for planning and allocating limited resources efficiently and minimizing casualties. A most important characteristic of an infective virus is its average rate of reproduction or $R_0$....

 · 6 min · Terry Rodriguez & Salma Mayorquin