Find The Top 15 Most Popular Movies In 2020
Description:
This challenge is to obtain the top-rated in theater movie from rotten tomatoes - In Theaters | Top Box Office.
Challenge instruction:
● web scrape the data from the following address: rotten tomatoes - In Theaters | Top Box Office
● The output data should only contain the following columns: title: string, tomatoScore: numeric, rank: numeric (the position based on tomato score, e.g., if we have 3 scores like 100, 100, 98, the corresponding rank is 1, 1, 3).
● The rank is based on the order of tomato score (descending) then movie title (ascending).
● we expect 40 records with tomato scores, but this could change at the time when you scrap the web page
● please upload your .ipynb or .py script and a .csv with 3 columns.
Introduction
Python has various libraries that can help you to do web scraping. Before starting, let's first think about how we should approach the problem. The idea to scrap a website should follow:
● check if someone already built something
● check if an API is publicly available or privately available
● last resort, go scrape on your own
Skills you will use:
● Web Scraping
● Work with API to get data
Results:
We expect 40 records with tomato scores, but this could change at the time when you scrape the web page
Submission:
Your submission should contain a .py or .ipynb file for your code and a .csv file for your result.








暂无数据