Skip to main content

13 Messages

 • 

504 Points

Tue, Jul 10, 2018 9:37 PM

Negligence of people is destroying photo galleries.

I just finished watching Blade Runner 2049 today, went as per usual to check everything after writing my review (photo gallery, to upload my images with good resolution as I always do) and I found 548 pictures already uploaded. I said, okay, this is an unusual number of photos, normally every title has 50-100 pictures maximum.I started checking the pictures and 80% of these pictures had either a horribly poor resolution (everything pixelated) or duplicates from other pictures.
How do these pictures get accepted in the first place? Also, how is it that the reporting function is so hard to use? I have to select every image one by one and report it, just like adding images has to be done one by one. I think this process is very cumbersome, it's not normal that I have to spend one or two hours deleting other people's bad images when it should be intuitive and easy to do.

Responses

5.7K Messages

 • 

145.4K Points

2 years ago

Hi, Jasper. Thanks for bringing this to everybody-who-is-listening's attention. Could you maybe post URLs to all of the inferior-quality duplicates and group the URLs by identicality-of-content?

As for how the pictures get accepted in the first place, I'd imagine that the IMDb data editors do not even have a way to compare every new image upload to previously uploaded images. What they probably need, and what we all probably need, is an automated script that spots duplicate images in a collection of image and also generally "sorts" images. Hell, maybe one already exists, but that's currently more so up Google's alley than Amazon's. I will check to see if there are any Gnu products or anonymous-access websites that would help.

Let's see if we can convince the IMDb staff to en masse delete (most of) the inferior duplicates, but it is important to understand it will be up to us to at least present the URLs of the duplicates that need deleting.

13 Messages

 • 

504 Points

I already spent an hour or more reporting them one by one, but here are some examples I have compiled in 20 minutes on Blade runner 2049:

https://www.imdb.com/title/tt1856101/mediaviewer/rm2040276736
https://www.imdb.com/title/tt1856101/mediaviewer/rm1201415936
https://www.imdb.com/title/tt1856101/mediaviewer/rm406553702
https://www.imdb.com/title/tt1856101/mediaviewer/rm2471701504
https://www.imdb.com/title/tt1856101/mediaviewer/rm601499904
https://www.imdb.com/title/tt1856101/mediaviewer/rm1953379584
https://www.imdb.com/title/tt1856101/mediaviewer/rm2677875712
https://www.imdb.com/title/tt1856101/mediaviewer/rm1659840256
https://www.imdb.com/title/tt1856101/mediaviewer/rm1493378560
https://www.imdb.com/title/tt1856101/mediaviewer/rm1560487424
https://www.imdb.com/title/tt1856101/mediaviewer/rm2372216320
https://www.imdb.com/title/tt1856101/mediaviewer/rm2388993536
https://www.imdb.com/title/tt1856101/mediaviewer/rm2741315072
https://www.imdb.com/title/tt1856101/mediaviewer/rm2791646720
https://www.imdb.com/title/tt1856101/mediaviewer/rm1399137792
https://www.imdb.com/title/tt1856101/mediaviewer/rm1415915008
https://www.imdb.com/title/tt1856101/mediaviewer/rm342173184
https://www.imdb.com/title/tt1856101/mediaviewer/rm358950400
https://www.imdb.com/title/tt1856101/mediaviewer/rm191178240
https://www.imdb.com/title/tt1856101/mediaviewer/rm23406080
https://www.imdb.com/title/tt1856101/mediaviewer/rm40183296
https://www.imdb.com/title/tt1856101/mediaviewer/rm644163072
https://www.imdb.com/title/tt1856101/mediaviewer/rm660940288
https://www.imdb.com/title/tt1856101/mediaviewer/rm4116981248
https://www.imdb.com/title/tt1856101/mediaviewer/rm4133758464
https://www.imdb.com/title/tt1856101/mediaviewer/rm2976130560
https://www.imdb.com/title/tt1856101/mediaviewer/rm3009684992
https://www.imdb.com/title/tt1856101/mediaviewer/rm1634940160
https://www.imdb.com/title/tt1856101/mediaviewer/rm2809017600
https://www.imdb.com/title/tt1856101/mediaviewer/rm1617835264
https://www.imdb.com/title/tt1856101/mediaviewer/rm3834000640
https://www.imdb.com/title/tt1856101/mediaviewer/rm1401304320
https://www.imdb.com/title/tt1856101/mediaviewer/rm2642752768

Poor Quality examples:
https://www.imdb.com/title/tt1856101/mediaviewer/rm3155120384
https://www.imdb.com/title/tt1856101/mediaviewer/rm3020902656

5.7K Messages

 • 

145.4K Points

Thanks, Jasper, but it would help if the duplicates were grouped (sorted) by similarity of content, such as particular images that basically were taken from the same time stamp (or film/digital frame number) of the movie's content. We would want to make sure that the best versions of every screen capture are retained. I suppose, we could take your word for it.

13 Messages

 • 

504 Points

I don't have electricity until tomorrow, I can't do it on the phone. Please,. if I forget page me here so I remember to do it asap.

2.3K Messages

 • 

80.3K Points

Jeorj,

I am a little bit puzzled: what is the editors' job? Should not this monitoring of duplicates and quality be part of their job description? I am keen on helping IMDb to stand its rank and objectives, but this is an obvious case where they should spot and process it first. I am keen to help IMDb maintain it's contents quality, but not doing for free what some paid employees are expected to do. Moreover, as mentioned, we have no tool to perform bulk processing, when I believe they do.

My 20 cents...

5.7K Messages

 • 

145.4K Points

HI, Vincent Fournols. Do we even know how many IMDb date editors there are? The volume of contributions per day being so large is exactly why even mentioned anything at all about automation. It would seem that mostly what the editors do is screen for content that crosses the decency line. I also emphasized "best version". I doubt the duplicate with the highest resolution and color depth would be immediately recognizable if it is presented as a thumbnail in the editors' interfaces. At this point, I don't believe the administrative user interface is significantly more advanced than the contribution interface for ordinary users.

2.3K Messages

 • 

80.3K Points

When it comes to picture quality, I am sure it can be assessed with quantitative data (picture sizes and resolutions). When it comes to duplicates, technology exists (see Facebook). So I derive that Amazon is not willing to invest that much in IMDb. Still puzzled...

Employee

 • 

3.6K Messages

 • 

76.5K Points

2 years ago

Hi all,

Thanks for reporting, we do try our best to catch duplicate image submissions however some do fall through the net, especially on a title as popular as Blade Runner 2049. Please go ahead and report any duplicates via the report this function and our data editors will be happy to take a look and go about removing them. In terms of a bulk image edit it's not currently possible, however you can add this as a new idea on Get Satisfaction and other users can vote on the idea to be prioritized accordingly.

Regards,
Will

1.6K Messages

 • 

57.4K Points

Given the sheer number of pictures added to the database, it's impossible to check them all I think, but perhaps the system could be tweaked so that when a picture is added for a title that already has X pictures listed (say 500) a picture doesn't automatically get added and requires a human eye to look at it. Maybe that might help prevent dupes??

136 Messages

 • 

4.2K Points

2 years ago

I noticed that there are still many duplicates for Blade Runner 2049 so may I suggest these URLs for deletion? These are just examples. Many more to come if the below questions are answered. 
https://www.imdb.com/title/tt1856101/mediaviewer/rm1634940160
https://www.imdb.com/title/tt1856101/mediaviewer/rm2809017600
https://www.imdb.com/title/tt1856101/mediaviewer/rm3834000640
https://www.imdb.com/title/tt1856101/mediaviewer/rm1785935104
https://www.imdb.com/title/tt1856101/mediaviewer/rm1617835264
https://www.imdb.com/title/tt1856101/mediaviewer/rm1401304320
https://www.imdb.com/title/tt1856101/mediaviewer/rm2508010752
https://www.imdb.com/title/tt1856101/mediaviewer/rm1533949184
https://www.imdb.com/title/tt1856101/mediaviewer/rm3867555072
https://www.imdb.com/title/tt1856101/mediaviewer/rm3043898624
https://www.imdb.com/title/tt1856101/mediaviewer/rm443430144
https://www.imdb.com/title/tt1856101/mediaviewer/rm2221815040
https://www.imdb.com/title/tt1856101/mediaviewer/rm3783668992
https://www.imdb.com/title/tt1856101/mediaviewer/rm3801625856
https://www.imdb.com/title/tt1856101/mediaviewer/rm2709927168
https://www.imdb.com/title/tt1856101/mediaviewer/rm1567831296
https://www.imdb.com/title/tt1856101/mediaviewer/rm3847433216
https://www.imdb.com/title/tt1856101/mediaviewer/rm1100752640

Also what is the general rule when you have several images that are only a few frames different? Basically the same image but maybe a smile is slightly bigger in one? Or a hand is in a different gesture? In my opinion these tend to clutter up the galleries. For example:
https://www.imdb.com/title/tt1856101/mediaviewer/rm4083426816
https://www.imdb.com/title/tt1856101/mediaviewer/rm4116981248
https://www.imdb.com/title/tt1856101/mediaviewer/rm4133758464
or
https://www.imdb.com/title/tt1856101/mediaviewer/rm2741315072
https://www.imdb.com/title/tt1856101/mediaviewer/rm2791646720

Also when movie or DVD posters are posted and only the text is different like "coming this summer" vs "coming this July". Or the European rating vs the American rating but otherwise the images are the same. Having dozens of posters and DVD covers that are basically the same really clutters the galleries. For example:
https://www.imdb.com/title/tt1856101/mediaviewer/rm1659840256
https://www.imdb.com/title/tt1856101/mediaviewer/rm2677875712

Thanks for your feedback in advance.

13 Messages

 • 

504 Points

2 years ago

For some reason they haven't processed the ~150 reports I made yet

Employee

 • 

4.1K Messages

 • 

140K Points

Please see https://help.imdb.com/article/contribution/contribution-information/when-will-my-update-be-added/GEK...

Since images are covered by the "Track My Contribution" feature, simply find the email receipt for any unprocessed images and click the link "View the status of your contribution" to check the status. 

13 Messages

 • 

504 Points

2 years ago

All my duplicate reports were declined for this reason:

5.7K Messages

 • 

145.4K Points

Jasper, ah, you didn't bother to provide an explanation of which particular duplicate gallery item should be the only one to not be deleted! It's important to clarify that sort of thing, otherwise the data editor assigned to the case will just think your claim is not worth believing.

13 Messages

 • 

504 Points

I normally do and I did for the first 25-30 reports, but after an hour and a half it got so boring that I just typed duplicate in them. That's what I mean, the main problem is people uploading images unknowingly of what's already been uploaded. The report function is almost neanderthal, and we're asked too much information due to the wrongdoings of other people

5.7K Messages

 • 

145.4K Points

Jasper, you have to do it for all of the reports, to make sure that context is not lost in the event of an editor unfamiliar with the situation is assigned to deal with a given report submission.

13 Messages

 • 

504 Points



Here is a contribution that got declined when I specifically said which item it was duplicate with.

5.7K Messages

 • 

145.4K Points

(180710-212616-958602)

The IMDb data editors prefer it if the image is identified by its "rm" key.

Jasper, I can understand how frustrating this must be for you to find out after the fact, but now you know for future reference.

Employee

 • 

1.2K Messages

 • 

36K Points

2 years ago

Hi there,

Thanks for your posts and efforts to keep IMDb as up to date and accurate as possible.

Please remember, editors have to vet a large amount of information across many different data types on a daily basis. As mentioned earlier in the thread - given the tools used to handle image deletions paired with large image galleries, it can be hard for staff to identify if a duplicate image is eligible for removal. Jeorj made a great point that a URL added within the explanation field to the duplicate allows the editor to quickly reference the duplication and vet appropriately.

If you can submit these and continue to experience issues, please don't hesitate to let us know and we'll do our best to help out.

Joel

2.3K Messages

 • 

80.3K Points

Thanks Joel, that is all very nice, but this is 2018 now, there are scores of softwares that can identify duplicated photos, the Chinese government is deploying a dynamic instant face recognition program. So maybe Amazon could invest just a little bit to keep IMDb abreast of its objective "to be the most comprehensive entertainment database", don't you think so ?

5.7K Messages

 • 

145.4K Points

Jeff Bezos' passion seems to be retail and related logistics, and Col Needham's passion seems to be documentation of cinema/television (and related logistics). It'll take some convincing to get them to adopt capabilities outside of those subjects that surpass that of the R&D arm of the public sector of the 1.4-billion-strong People's Republic of China. Haha. I know there are S3, Echo and Alexa, but those don't currently seem to be the "essence" of Amazon.