1 Message
•
70 Points
Dupplicate titleId for movies
I've found some movies/series that appear to have multiple titleIds.
Examples:
tt2404467 -> tt2278388 for The Grand Budapest Hotel
tt12851524 -> tt11691774 for Only Murders in the Building
The first titleId gets redirected to the second titleIds when using the www.imdb.com web interface.
Could somebody explain what the reason for this duplicate ids are?
I use the titleIds to manage my own collection of movies and this duplication is causing some problems.
Also the redirecting titleIds are not included in https://datasets.imdbws.com/title.basics.tsv.gz
Would it be possible to release a new dataset that contains this mapping info?


gromit82
Champion
•
7.8K Messages
•
280.6K Points
2 years ago
p_g: It seems most likely that these titles were mistakenly entered into IMDb twice as separate entries, but when it was discovered that the film/TV series had been entered twice, the later entry (i.e. the one with the higher number in the title constant) was redirected to the earlier entry (the one with the lower number).
The official title constant is the one that doesn't get redirected (such as tt2278388 for The Grand Budapest Hotel), so I would recommend using that one for your personal collection.
I have no information about the datasets, so hopefully someone else will comment on that issue.
0