热线电话:13121318867

登录
2020-07-24 阅读量: 956
​Find The Top 15 Most Popular Movies In 2020

Find The Top 15 Most Popular Movies In 2020

Description:

This challenge is to obtain the top-rated in theater movie from rotten tomatoes - In Theaters | Top Box Office.

Challenge instruction:

● web scrape the data from the following address: rotten tomatoes - In Theaters | Top Box Office

● The output data should only contain the following columns: title: string, tomatoScore: numeric, rank: numeric (the position based on tomato score, e.g., if we have 3 scores like 100, 100, 98, the corresponding rank is 1, 1, 3).

● The rank is based on the order of tomato score (descending) then movie title (ascending).

● we expect 40 records with tomato scores, but this could change at the time when you scrap the web page

● please upload your .ipynb or .py script and a .csv with 3 columns.

Introduction

Python has various libraries that can help you to do web scraping. Before starting, let's first think about how we should approach the problem. The idea to scrap a website should follow:

● check if someone already built something

● check if an API is publicly available or privately available

● last resort, go scrape on your own

Skills you will use:

● Web Scraping

● Work with API to get data

Results:

We expect 40 records with tomato scores, but this could change at the time when you scrape the web page

Submission:

Your submission should contain a .py or .ipynb file for your code and a .csv file for your result.

25.5616
1
关注作者
收藏
评论(0)

发表评论

暂无数据
推荐帖子