1 Message
•
100 Points
database report counting unique Actors, Directors, et cetera
I found the database statistics page, but I'm looking for information that would require a database query and report.
For example, of the ten million names on IMDb, how many have one or more Director credits? How many have one or more Actor credits?
I see 27 million Actor records, 17 million Actress records, 5 million Director records... but I assume that these count the same person multiple times. That has to be true, with 27 million actor records and only 10 million names. So, I think a lot of people would like to know how many unique Actors there are, without counting them multiple times for multiple movie appearances.
If there's already a page with these sorts of summary reports, please let me know! Otherwise, I think it would be good promotional material to highlight somewhere on IMDb.com (above the raw Statistics, which might be a bit too much detail for most people).
For example, of the ten million names on IMDb, how many have one or more Director credits? How many have one or more Actor credits?
I see 27 million Actor records, 17 million Actress records, 5 million Director records... but I assume that these count the same person multiple times. That has to be true, with 27 million actor records and only 10 million names. So, I think a lot of people would like to know how many unique Actors there are, without counting them multiple times for multiple movie appearances.
If there's already a page with these sorts of summary reports, please let me know! Otherwise, I think it would be good promotional material to highlight somewhere on IMDb.com (above the raw Statistics, which might be a bit too much detail for most people).
ljdoncel
Champion
•
1.1K Messages
•
50.9K Points
4 years ago
I agree that the statistics page ideally should give a lot more information, mainly for there is many more interesting results to be reported, but also to put in perspective the true (large) magnitude of the database. I hope developers improve this soon (...or eventually at least ).
In the meantime, you can gather summary reports yourself by looking at the plain text datasets. They contain a small subset of variables only, but sufficient to get listings like the one you're asking for.
I recently processed name.filmography.tsv (see below), so I can offer you some quite up-to-date numbers.
As of 07 Aug 2020:
TOTAL NAMES: 9,658,245 names
TOTAL CREDITS:
By CURRENT COUNTING SYSTEM (episodes count separately): 132,493,587 credits
If multiple jobs in the same category for a same title are weighted: 133,358,152 credits
By ALTERNATIVE COUNTING SYSTEM (episodes are consolidated): 50,655,246 credits
If multiple jobs in the same category for a same title are weighted: 51,509,562 credits
Cheers!
0