
Champion
•
5.1K Messages
•
118.7K Points
finding librett*
I would like to identify every person on IMDb who's got a credit for librettist or libretto.
I downloaded the dataset title.crew.tsv, but I don't have software that can handle the volume of data.
LibreOffice loaded ~1,048k records, but I suspect that's a drop in the bucket.
OpenOffice couldn't even load the file at all.
Is there some other (free) software that can handle this job?
All I need to do is sort that file by name id, and search for "librett".
Unless someone who does this a lot can extract those "librett" records for me.
@ljdoncel , what software do you use?




Accepted Solution
ljdoncel
Champion
•
1.3K Messages
•
53.7K Points
5 years ago
Hi, @bderoes :
You are correct in saying that most spreadsheets or DBMS can't handle some of the very huge tsv files. I have noticed that even programs like SPSS, which allows unlimited number of records (in theory), are not able to open the largest files directly without first shortening/pre-filtering them. To achieve that, I use a text editor capable of handling large files (EmEditor) where, depending on the query I'm working on, I use to do one (or both) of the following:
I performed a quick search and there are around 13,000 cases of librett* in the database:
https://jpst.it/2jjOY
https://jpst.it/2jjOY
Cheers!
https://jpst.it/2jjOY
5
ACT_1
8.8K Messages
•
179.5K Points
5 years ago
? ?
goto the folder with the db file on file explorer
and you can search there for librettist or libretto text ? ?
This may not be helpful ... :-(
- - -
Advanced Name Search
https://www.imdb.com/search/name/
No option for Job title ? ?
Biographies
https://www.imdb.com/search/name/?bio=librettist - 59 names.
https://www.imdb.com/search/name/?bio=libretto - 93 names.
.
(edited)
1
0
jeorj_euler
10.7K Messages
•
226.1K Points
5 years ago
What about Gnu Regular Expression Program?
0