= Path('/kaggle/input/the-movies-dataset')
base_path
= pd.read_csv(base_path/'credits.csv')
credits = pd.read_csv(base_path/'keywords.csv')
keywords = pd.read_csv(base_path/'movies_metadata.csv').\
movies 'belongs_to_collection', 'homepage', 'imdb_id', 'poster_path', 'status', 'title', 'video'], axis=1).\
drop([19730, 29503, 35587]) # Incorrect data type
drop([
'id'] = movies['id'].astype('int64')
movies[
= movies.merge(keywords, on='id').\
df ='id')
merge(credits, on
'original_language'] = df['original_language'].fillna('')
df['runtime'] = df['runtime'].fillna(0)
df['tagline'] = df['tagline'].fillna('')
df[
=True)
df.dropna(inplace
= pd.read_csv(base_path/'ratings_small.csv')
ratings_df
'date'] = ratings_df['timestamp'].apply(lambda x: datetime.fromtimestamp(x))
ratings_df['timestamp', axis=1, inplace=True)
ratings_df.drop(
= ratings_df.merge(df[['id', 'original_title', 'genres', 'overview']], left_on='movieId',right_on='id', how='left')
ratings_df = ratings_df[~ratings_df['id'].isna()]
ratings_df 'id', axis=1, inplace=True)
ratings_df.drop(=True, inplace=True)
ratings_df.reset_index(drop
ratings_df.head()
= df[['id', 'original_title']]
movies_df ={'id':'movieId'}, inplace=True)
movies_df.rename(columns ratings_df.merge(movies_df)
userId | movieId | rating | date | original_title | genres | overview | |
---|---|---|---|---|---|---|---|
0 | 1 | 1371 | 2.5 | 2009-12-14 02:52:15 | Rocky III | [{'id': 18, 'name': 'Drama'}] | Now the world champion, Rocky Balboa is living in luxury and only fighting opponents who pose no threat to him in the ring. His lifestyle of wealth and idleness is shaken when a powerful young fighter known as Clubber Lang challenges him to a bout. After taking a pounding from Lang, the humbled champ turns to former bitter rival Apollo Creed to help him regain his form for a rematch with Lang. |
1 | 4 | 1371 | 4.0 | 2000-02-06 04:11:42 | Rocky III | [{'id': 18, 'name': 'Drama'}] | Now the world champion, Rocky Balboa is living in luxury and only fighting opponents who pose no threat to him in the ring. His lifestyle of wealth and idleness is shaken when a powerful young fighter known as Clubber Lang challenges him to a bout. After taking a pounding from Lang, the humbled champ turns to former bitter rival Apollo Creed to help him regain his form for a rematch with Lang. |
2 | 7 | 1371 | 3.0 | 1996-12-29 14:19:20 | Rocky III | [{'id': 18, 'name': 'Drama'}] | Now the world champion, Rocky Balboa is living in luxury and only fighting opponents who pose no threat to him in the ring. His lifestyle of wealth and idleness is shaken when a powerful young fighter known as Clubber Lang challenges him to a bout. After taking a pounding from Lang, the humbled champ turns to former bitter rival Apollo Creed to help him regain his form for a rematch with Lang. |
3 | 19 | 1371 | 4.0 | 1997-02-06 01:43:24 | Rocky III | [{'id': 18, 'name': 'Drama'}] | Now the world champion, Rocky Balboa is living in luxury and only fighting opponents who pose no threat to him in the ring. His lifestyle of wealth and idleness is shaken when a powerful young fighter known as Clubber Lang challenges him to a bout. After taking a pounding from Lang, the humbled champ turns to former bitter rival Apollo Creed to help him regain his form for a rematch with Lang. |
4 | 21 | 1371 | 3.0 | 1997-01-21 13:11:03 | Rocky III | [{'id': 18, 'name': 'Drama'}] | Now the world champion, Rocky Balboa is living in luxury and only fighting opponents who pose no threat to him in the ring. His lifestyle of wealth and idleness is shaken when a powerful young fighter known as Clubber Lang challenges him to a bout. After taking a pounding from Lang, the humbled champ turns to former bitter rival Apollo Creed to help him regain his form for a rematch with Lang. |
... | ... | ... | ... | ... | ... | ... | ... |
45184 | 652 | 129009 | 4.0 | 2015-09-19 19:27:07 | Love Is a Ball | [{'id': 35, 'name': 'Comedy'}, {'id': 10749, 'name': 'Romance'}] | Etienne makes a good living out of marrying off poor but titled young men to rich but untitled young ladies. Millicent is now in his sights on the Riviera, and Grand Duke Gaspar is the bait. But what if Millicent starts to fancy planted chauffeur John instead, and Gaspar takes a shine to Etienne's secretary Janine? |
45185 | 653 | 2103 | 3.0 | 2000-01-18 02:04:26 | Solaris | [{'id': 18, 'name': 'Drama'}, {'id': 878, 'name': 'Science Fiction'}, {'id': 9648, 'name': 'Mystery'}, {'id': 10749, 'name': 'Romance'}] | Upon arrival at the space station orbiting an ocean world called Solaris a psychologist discovers that the commander of an expedition to the planet has died mysteriously. Other strange events soon start happening as well, such as the appearance of old acquaintances of the crew, including some who are dead. |
45186 | 659 | 167 | 4.0 | 1996-06-30 12:25:50 | K-PAX | [{'id': 18, 'name': 'Drama'}, {'id': 878, 'name': 'Science Fiction'}] | Prot is a patient at a mental hospital who claims to be from a far away Planet. His psychiatrist tries to help him, only to begin to doubt his own explanations. |
45187 | 659 | 563 | 3.0 | 1996-06-13 19:29:47 | Starship Troopers | [{'id': 12, 'name': 'Adventure'}, {'id': 28, 'name': 'Action'}, {'id': 53, 'name': 'Thriller'}, {'id': 878, 'name': 'Science Fiction'}] | Set in the future, the story follows a young soldier named Johnny Rico and his exploits in the Mobile Infantry. Rico's military career progresses from recruit to non-commissioned officer and finally to officer against the backdrop of an interstellar war between mankind and an arachnoid species known as "the Bugs". |
45188 | 665 | 129 | 3.0 | 2001-07-15 21:28:48 | 千と千尋の神隠し | [{'id': 14, 'name': 'Fantasy'}, {'id': 12, 'name': 'Adventure'}, {'id': 16, 'name': 'Animation'}, {'id': 10751, 'name': 'Family'}] | A ten year old girl who wanders away from her parents along a path that leads to a world ruled by strange and unusual monster-like animals. Her parents have been changed into pigs along with others inside a bathhouse full of these creatures. Will she ever see the world how it once was? |
45189 rows × 7 columns