3 Messages

 • 

90 Points

Thursday, October 20th, 2022

Closed

Solved

Genres listed as "\N" within title.basics.tsv dataset

Hello,

After running in the title.basics.tsv dataset into RStudio, I noticed that all of the genre fields are null. Is this intentional, or is the data I downloaded somehow not correct? Not a single row has a genre listed, and i'm not sure if this is user error or not. I noticed on the dataset description it lists the genre as the only string array listed, so i'm not sure if that's it or not. Any help would be appreciated, thank you!

Oldest First
Selected Oldest First

2 Messages

 • 

70 Points

4 years ago

I am also encountering this error. Though technically, my coworkers are encountering this error, but somehow when I download the file from the same link, I have genres, but they don't. This is happening to 3 coworkers as of today.

2 Messages

 • 

70 Points

Some additional info:

- If I load the data directly into pandas with python using read_csv, I get the genres.

- BUT if I right-click on the link and Save Link As and download the file FIRST, then I do NOT get genres (and get all \N).

3 Messages

 • 

90 Points

So from the link itself, should I not be downloading the zip file from "title.basics.tsv.gz" and then unzipping it? How would I read the data directly into pandas without downloading the file first? I'm relatively new to pandas, but I know how to read_csv's and create a dataframe. Just not sure how to get the data directly without downloading it. 

Any advice would be appreciated! Thank you.

3 Messages

 • 

90 Points

Also, not sure if this is possible via this discussion post, but would you be able to send me the file with the genres included? I tried reading the data directly into a pandas DF from the url without downloading it, but I'm still turning up nothing.

2 Messages

 • 

70 Points

4 years ago

Hello!

I'd just like to report that the dataset called "title.basics.tsv.gz" does not contain the column "genres" anymore, since a couple of weeks.

The column is there, but the value is "\N" (null) for all the titles.

Is this a bug? Can it be solved somehow?

Thank you!

Riccardo

Note: This comment was created from a merged conversation originally titled Genres disappeared from title.basics dataset

Employee

 • 

5.6K Messages

 • 

58.9K Points

4 years ago

Hello all!

We have now reported this to our team in charge of datasets, I will let you know as soon as I have any updates.

Cheers!

Employee

 • 

5.6K Messages

 • 

58.9K Points

4 years ago

Hello everyone!

 

Issue was fixed.

 

Cheers!