keyword_expert's profile

2.7K Messages

 • 

47K Points

Friday, October 28th, 2022 10:36 PM

Closed

Solved

Keyword Searches: IMDb has changed its display format for search results

As of a few minutes ago, the IMDb keyword search results changed significantly when I view in my browser. 

I will discuss the new display in this post, and will also ask three questions (in bold-face font below).

Below are a couple screenshots of what the search results look like now.

https://www.imdb.com/find?s=kw&q=argue+couple&ref_=nv_sr_sm

The search results are presented in a more "modern" format on a white background, with larger, bolded text for the keywords.

One big change is that the number of titles for each keyword is now presented on a separate row from the keyword itself. This will hurt my ability to post lists of duplicate keywords for mass mergers. Previously, I could easily copy and paste keywords to assemble lists like this:

arguing-couple (15 titles)  -->  couple's-argument (2 titles)  -->  argument-between-couple (75 titles)  -->   couple-argues (27 titles)

knife-held-to-someone's-throat (40 titles)  -->  knife-held-to-one's-throat (20 titles)  -->  holding-a-knife-at-someone's-throat (22 titles)  -->  holding-a-knife-to-someone's-throat (11 titles)  -->  hold-a-knife-to-someone's-throat (3 titles)  -->   knife-held-to-throat (444 titles) 

Using the new display, such a list would look something like this (matching the keywords in my first example shown above):

arguing couple

  • 15 titles

couple's argument

  • 2 titles

argument between couple

  • 77 titles

couple argues

  • 30 titles

I could manually edit such a list to present it in the old format, but that would take a lot of work. Which brings me to my first two questions.

1. Is it possible to opt out of the new display format for search results?

2. If not, could the new display show the number of titles in parentheses in the same row as each keyword, like it was before?

One other major change: for keyword searches on popular words, the old display would include up to 200 results. The new display only includes 25 results at a time, and you have to keep clicking "More popular matches" to show more. Here is an example link to show what I mean. Having to click that button 7 times to get the same 200 results as before is less than ideal. On the more positive side, if you click the button enough, you can display way more than 200 results. I tested it using this link and got up to 1,200 results, which is likely the limit. But that required clicking the "More" button 47 times. Which brings me to my third and final question.

3. Can the new display be modified to show up to 100 keyword results as the default (instead of 25 results)?

Displaying up to 100 results would really help with the usability/functionality of the new system. I would even settle for 50 results, but I would prefer 100 results. 25 results is way too small as a default, and will require a lot of button clicking just to get a good understanding of what is going on with popular words within keywords. 

This conversation has been merged. Please refer the main conversation:

Issues with the updated /find search results display

Employee

 • 

7.1K Messages

 • 

176.6K Points

2 years ago

@keyword_expert  Thanks for the feedback.  To answer your questions:

1. No sorry, the new experience is gradually being rolled out to increasingly larger numbers of customers.  Once it reaches 100% the old search software will be retired. 

2. No sorry, this is the new display format.  Since your keyword merge requests are manually reported and manually processed anyway, just do not worry about how they look.

We will report the result sizes issue to the appropriate team.   There’s a smaller initial result set so as not to overwhelm people with results when it is likely their desired result is in the first set anyway, however, once a customer presses a “more” link there’s less reason to keep adding results in only 25 item chunks.

3. See above :-)

Hope this helps.  

2.7K Messages

 • 

47K Points

@Col_Needham​ 

2. No sorry, this is the new display format.  

This new format also means a whole bunch of wasted white space in the middle of the screen. In that sense, it is similar to the new name page design, which has received criticism for similar reasons (too much wasted white space). There is definitely sufficient room in the middle of the screen to fit the numbers of titles for most if not all keywords of any length. I don't understand why IMDb would not want to use that wasted white space.

Also, because the new design uses two lines for each keyword (one line for the keyword in a bold font, the second line for the number of titles) that means that the results extend much further down the screen than necessary. 

Since your keyword merge requests are manually reported and manually processed anyway, just do not worry about how they look.

My concern is not so much how they look, but rather whether they will be formatted so that they are functional. I will probably now have to display groups of duplicate keywords vertically rather than horizontally. That could result in some very lengthy lists. 

There’s a smaller initial result set so as not to overwhelm people with results when it is likely their desired result is in the first set anyway, however, once a customer presses a “more” link there’s less reason to keep adding results in only 25 item chunks.

25 initial results is way too small for initial results for many searches.

How about displaying 50 results initially, and then an additional 100 results each time the "More" button is clicked? I could work with that. It would basically mean having to click the button twice to get back to what the old design showed by default. Plus, it would also allow for up to 1,200 results with 12 clicks of the button (instead of 47 clicks), which would be nice to have that many results when needed.

I would also love it if pushing the "Page-Down" button on a keyboard would extend the list (instead of having to click the "More" button each time), if that is possible.

Employee

 • 

7.1K Messages

 • 

176.6K Points

@keyword_expert​  As mentioned above,  we will look at the options for more results to be displayed when “more” is pressed, but there are good reasons for keeping the initial result set smaller.  Your idea on keyboard shortcuts is noted too, thanks. 

2.7K Messages

 • 

47K Points

@Col_Needham​ 

but there are good reasons for keeping the initial result set smaller.

And there are also good reasons for my asking for more than 25 initial results. (It used to be up to 200 initial results, which means the new display reduces the numbers by 87.5% for many initial results.)

I assume one of your reasons for wanting to keep the initial results smaller is to keep the length on the page shorter. If the numbers of titles were displayed on the same rows as the keywords, that would solve that issue.

I assume another reason is to reduce the amount of data that has to flow from IMDb to the user. This is understandable, but all I am asking for is a compromise of up to 50 initial results (a 75% reduction from the old version, rather than the current 87.5% reduction as discussed above).

Keep in mind that this would be "up to" 50 results. Many keyword searches will display less than 50 keywords (and often less than 25), anyway. But for the searches that would result in hundreds of results, it will help users to give an indication of that outcome by displaying the first 50 results (rather than just 25). 

(edited)

2.7K Messages

 • 

47K Points

2 years ago

I figured out a way to get back to the old display format -- at least for now. Simply logging out of my IMDb account, and then logging back in, works. 

Here are screenshots of the old display format, for comparison to the screenshots of the new display I posted above.

For the "knife throat" search, when comparing the old version to the new version, it's clear that not only is the format different, but the ordering of the search results is different, too. The old search was more likely to incorporate different variations of words (and similar words) into the results and display them earlier in the results, while the new search is more strict about searching for the exact words entered and displaying those first in the results.

I believe this is another reason why the initial results should display more than 25 results. With the new search format, if someone searches using a word like "decapitation," they won't see results like "decapitated-corpse" until much later than the first 25 results. I wish I could show screenshots of that for comparison purposes, but when I logged back in, I am now back to the old search results display (which I do prefer overall -- the only thing I like about the new display is the option of displaying up to 1,200 results). 

2.7K Messages

 • 

47K Points

Here are the "knife throat" screenshots back to back (old format, then new format), for comparison purposes.

(edited)

2.7K Messages

 • 

47K Points

I have now been forced back into the new display for search results. 

:(

I previously said:

I believe this is another reason why the initial results should display more than 25 results. With the new search format, if someone searches using a word like "decapitation," they won't see results like "decapitated-corpse" until much later than the first 25 results. I wish I could show screenshots of that for comparison purposes, but when I logged back in, I am now back to the old search results display (which I do prefer overall -- the only thing I like about the new display is the option of displaying up to 1,200 results). 

Now that I am seeing the new display, I tested this, and sure enough -- when I search for the word "decapitation," the keyword "decapitated-corpse" does not appear within the first 25 results. Rather, it doesn't appear until result #48. 

The old search results did a "smarter" job of retrieving keywords based not only on the closeness of the wording but also the prevalence of keyword usage on the database, and it even flipped the searched words to retrieve relevant/prevalent keywords that might use the same words in a different order.

The new results are much more strict about limiting the top 25 results to keywords that begin with the same words as searched in the same order, even if each keyword has only been used on 1 title.

This can be quite limiting and give one a sense of tunnel vision or a filter bubble (pick whatever analogy you want to use) -- especially when the system is only displaying 25 results on the first click (as opposed to the previous 200 results). 

And because the results in the new display take up a lot more space on the screen, it takes a lot of scrolling just to view the first 25 results. This is even further limiting.

With this new system, I fear in many cases people won't quickly find the best (most relevant) keywords, and will move on without even realizing it.

(edited)

2.7K Messages

 • 

47K Points

2 years ago

The one good thing I will say about the new keyword search display is that it now picks up newly created keywords within just a few minutes after the keywords are created.

Previously, the search system would only pick up newly created keywords once per week, on Sundays.

Now, they are picked up almost immediately. They appear as "0 titles" on the initial search results display, until the number of titles is updated on Sundays. 

But this is about the only good thing I can say about the new search. I still much prefer the old search, which displayed up to 200 results at a glance, gave the number of titles in the same row as the keyword, took up less space on the screen per keyword, was easier to read and use, and incorporated a "smarter" search engine. 

Employee

 • 

1.1K Messages

 • 

10.7K Points

Hello @keyword_expert,

As Mr Col said, your idea on keyboard shortcuts is noted. Thank you very much for the feedback. IMDb is always and will be trying to improve.

Have a nice day.

2.7K Messages

 • 

47K Points

2 years ago

Could somebody please take this thread off of "Solved" status? I don't regard most of the issues discussed herein as solved. 

2.7K Messages

 • 

47K Points

2 years ago

@Col_Needham @Jon 

I discovered some more problems with the new keyword search engine, involving "exact matches" of the keywords searched for. 

I will illustrate these problems using the exact keyword search "brother relationship" as an example.

Let's say I search for "brother relationship" because I am interested to see if that exact keyword exists, and also because I want to see if similar keywords exist.

Here is that search, and below is a screenshot of the current top 11 results: https://www.imdb.com/find?s=kw&q=brother+relationship

Note that the exact keyword I searched for, "brother relationship," does not appear until #10 in the results. 

And strangely, "brother brother brother relationship" appears as #1 in the results, despite its relatively low popularity compared to the very popular "brother brother relationship" (which appears as #2). 

Apparently the search engine assigns extra weight to the repeated prevalence of search words within the keywords themselves. This doesn't feel like the right way to power the search engine.

Here is how I would fix this. If the keyword searched for has an exact match in the current keyword system (referred to as "exact matches" (plural) in the new keyword search engine), I would always list that exact match as the #1 spot in the "popular matches" results. 

For example, for my sample search "brother relationship" there is currently an exact match keyword for "brother-relationship" so I would list that exact keyword first in the "popular" results.

As for spots #2, #3, etc., I would power the keyword search engine more like the old engine than the new engine.

The old engine "smartly" assigned extra weight to the prevalence/popularity of similarly worded keywords, placing extremely popular keywords higher in the results, above extremely unpopular keywords that might begin with the same exact wording.

In contrast, the new engine returns too many keywords that begin with the exact words searched for, without regard to popularity, thus returning too many keywords in the top 25 results that are currently only assigned to 1 title. 

And as previously discussed, the new engine for some reason gives extra weight to keywords that repeat the term searched for, in this case placing "brother-brother-brother-relationship" as #1 in the results. That makes no sense. 

The old engine also did a better job of flipping the order of the terms searched for, in order to provide more relevant and/or popular results higher in the list.

For example, if you do a keyword search for "child murder" the new engine does not display what you are (likely) actually searching for: the very popular keyword "murder-of-a-child," until #26 in the results (which, of course, is not within the first 25 results, which also involves the other serious problem I have tried to bring to your attention -- that 25 results is too few).

https://www.imdb.com/find?s=kw&q=child+murder&ref_=nv_sr_sm

Search "child murder"

Keywords

Again, "murder of a child" is #26 in the results. Thus, if you search for "child murder," you will literally not even see "murder of a child" in the so-called "popular" results returned by IMDb.  This is a problem.

I have to wonder whether IMDb is powering its new search engine with the exact same engine when searching for "popular" keywords as when searching for "popular" titles and celebrities. It does seem that way to me. This would be a mistake, since there are important differences with keywords.

For example, keywords are unique in that it is impossible to have exact duplicate keywords. There can be multiple movies named "Brother Relationship" and multiple people named "John Smith" but there can only be one keyword named "brother-relationship" and therefore only one exact match when you search for that keyword. The unique things about keywords really need to be factored in to the search engine.

I wish there were a way to access the old keyword search engine and test it side by side with the new engine to provide even better examples of why the old engine was overall superior to the new engine. Hopefully the issues discussed in this thread will give you a sense of some of the many problems with the new engine, which does not appear to have been properly vetted.

2.7K Messages

 • 

47K Points

2 years ago

Here is another example to illustrate one of the problems with the new search engine for keywords.

When I do a keyword search for "killed," popular keywords like police officer killed, returning character killed off, actual animal killed, characters killed one by one, man killed, innocent person killed, shot and killed, etc. do not start to appear until result #411.

That's because the new search engine first returns all keywords that literally begin with the word "killed," including hundreds of keywords currently only assigned to 1, 2, or 3 titles.

And yet this new search engine purports to be a search for "popular matches." Obviously that is not true. 

I am pretty confident in saying that the old search engine would have done a better job of returning these actually popular keywords (bolded below) higher in the list of results. 

Search "killed"

Keywords

Exact matches

(edited)

2.7K Messages

 • 

47K Points

2 years ago

Here is another example search to illustrate one of the problems with the new keyword search. When I search for "brother sister relationship" that exact match is listed #2 in the list, after a much less popular keyword.

If there is an exact match for any keyword, that exact match should always be given the #1 spot in the results. But this concept is limited to exact matches (of which there will always be only 1 or 0 for any given keyword search, since every keyword is unique). 

Search "brother sister relationship"

Keywords

Employee

 • 

729 Messages

 • 

8K Points

@keyword_expert​ 

We appreciate you taking the time to share your feedback. We are always working on ways to improve IMDb and I was sure to share this feedback with the team.

IMDb.com is constantly being updated and improved, and we welcome all comments and suggestions aimed at improving its features, flexibility and ease of use.

Thanks

2.7K Messages

 • 

47K Points

2 years ago

As further proof that there is something really "off" about the new keyword search engine, here is another example.

When you search for "one day time span" (with a space in "time span") the very popular and nearly identical result "one day timespan" is literally the last keyword listed in the search results. And because the new display takes up two rows per entry instead of one row, when viewing on most computer screens, that last search result won't even be visible unless you scroll down. 

Search "one day time span"

Keywords

Exact matches

(edited)

2.7K Messages

 • 

47K Points

2 years ago

Here is another weird situation illustrating the shortfalls of the new keyword search algorithm.

Why is the exact item searched for not listed until the fifth result?

Search "reference to laura palmer"

Keywords

Exact matches

10.6K Messages

 • 

224.9K Points

2 years ago

The following is a bookmarklet (or script code prefixed with a out-syntax scheme identifier) which, when used on an IMDb keyword search results page, will convert the format to the way effectively it used to be.


(
  function(p,m,e,i,n,t)
  {
    for(n=e.length,i=0;i<n;i++)
    {
      e[i].style.display="inline";
      t=e[i][m]("a"),
      t[p]=t[p].replace(/ /g,"-");
      if(t=e[i][m]("ul li label"))
        e[i][m]("ul").outerHTML=" ("+t[p]+")"
    }
  }
)
("innerHTML","querySelector",document.querySelectorAll(".ipc-metadata-list-summary-item__tc"))

If you know what a bookmarklet is, then you already know what to do, otherwise to "install" the tool, you would highlight the aforementioned source code (a JavaScript URI), copy it (to the clipboard) and paste it into URL field of a browser bookmark, whether already existing or newly created.

To use a bookmarklet, you "visit" it while having the particular webpage (in all of the Web) as the active tab in your browser. You don't need to understand JavaScript or any scripting language to use a bookmarklet.

The properly-encoded URI for that same source code is (function(p,m,e,i,n,t){for(n=e.length,i=0;i<n;i++){e[i].style.display="inline";t=e[i][m]("a"),t[p]=t[p].replace(/%20/g,"-");if(t=e[i][m]("ul%20li%20label"))e[i][m]("ul").outerHTML="%20("+t[p]+")"}})("innerHTML","querySelector",document.querySelectorAll(".ipc-metadata-list-summary-item__tc")); by the way. I previously presented this with line breaks and indentation to make it easier to read and thereby easier to understand, which in turn helps a semi-experienced individual screen it for malicious script behavior and errors in the flow of control that could unintentionally cause problems with resource allocation on any computer on which it is executed, meaning that trusting me would not be requisite for understanding that the script is safe.

I don't have a script for generating a longer or comprehensive listing of keywords in a search results page, as of this time.

10.6K Messages

 • 

224.9K Points

Alas, it would appear that the forum software is automatically filtering out any instance of a colon immediately preceded by the word "javascript", which is how the text of all bookmarklets begin:

This didn't used to happen on here (e.g. https://community-imdb.sprinklr.com/conversations/imdbcom/reviews-interface-improvement-idea/5f4a79e78815453dba8f4034?commentId=5f4a7b6f8815453dbad85109). Apparently somebody on board software management team believes that censorship upon this kind of stuff improves security or prevents code injection. Ridiculous!

2.7K Messages

 • 

47K Points

@jeorj_euler​ 

I just successfully installed your bookmarklet. Thank you, Thank you, Thank you!!

You don't even know how much it means to me to get the old format back for keyword searches (results displayed one per line, with the number of titles in parentheses). This will solve a major problem that I was having in trying to figure out how to present my duplicate keyword lists in the future. Now I can keep presenting them in the same way as I always have -- a tried and true proven format -- with minimal extra effort.

You have given me (and the entire IMDb community) an important gift this Christmas season. Thank you!!!

2.7K Messages

 • 

47K Points

@jeorj_euler​ p.s. And you even added the hyphens back into the keywords as displayed. Awesome! I appreciate your attention to detail.

10.6K Messages

 • 

224.9K Points

I hadn't been paying close attention all along (for the past year or so). When I realized the gravity of the situation, I had to share a workaround. You, keyword expert, seem the be the only person around here who is really trying to promote quality control in the keywords assignments system.

2.7K Messages

 • 

47K Points

2 years ago

Another example: the exact keyword searched for, "reference-to-christ," does not appear in the results until result #48.

Search "reference to christ"

Keywords

Exact matches

10.6K Messages

 • 

224.9K Points

2 years ago

The following is a bookmarklet (or script code prefixed with a out-syntax scheme identifier) which, when used on an IMDb keyword search results page, will convert the format to the way effectively it used to be.

javascript:
(
  function(p,m,e,i,n,t)
  {
    for(n=e.length,i=0;i<n;i++)
    {
      e[i].style.display="inline";
      t=e[i][m]("a"),
      t[p]=t[p].replace(/ /g,"-");
      if(t=e[i][m]("ul li label"))
        e[i][m]("ul").outerHTML=" ("+t[p]+")"
    }
  }
)
("innerHTML","querySelector",document.querySelectorAll(".ipc-metadata-list-summary-item__tc"))

If you know what a bookmarklet is, then you already know what to do, otherwise to "install" the tool, you would highlight the aforementioned source code (a JavaScript URI), copy it (to the clipboard) and paste it into URL field of a browser bookmark, whether already existing or newly created.

To use a bookmarklet, you "visit" it while having the particular webpage (in all of the Web) as the active tab in your browser. You don't need to understand JavaScript or any scripting language to use a bookmarklet.

The properly-encoded URI for that same source code is javascript:(function(p,m,e,i,n,t){for(n=e.length,i=0;i<n;i++){e[i].style.display="inline";t=e[i][m]("a"),t[p]=t[p].replace(/%20/g,"-");if(t=e[i][m]("ul%20li%20label"))e[i][m]("ul").outerHTML="%20("+t[p]+")"}})("innerHTML","querySelector",document.querySelectorAll(".ipc-metadata-list-summary-item__tc")); by the way. I previously presented this with line breaks and indentation to make it easier to read and thereby easier to understand, which in turn helps a semi-experienced individual screen it for malicious script behavior and errors in the flow of control that could unintentionally cause problems with resource allocation on any computer on which it is executed, meaning that trusting me would not be requisite for understanding that the script is safe.

I don't have a script for generating a longer or comprehensive listing of keywords in a search results page, as of this time.

10.6K Messages

 • 

224.9K Points

2 years ago

I just edited and deleted Sprinklr comment 63a8ffb88a78b24108718f80, so Sprinklr reply 63ae62c548a8017448664f11 by keyword_expert has become invisible to everybody, even though a record of it has been kept and still shows up in my notifications. I'm noticing some glitches in the way that the forum software filters the contents of posts. For instance, whenever the words "top", "bottom", "left", "right", or "display" are followed by a colon and a quotation mark, the filtering system will remove the whole word, the colon and that quotation mark, along with any white space characters between the word and symbols. The aforementioned words are Cascading Style Sheet (CSS) description properties, and may appear in HTML "style" tags or in the HTML "style" attributes of respective HTML element tags.

10.6K Messages

 • 

224.9K Points

As a note to all, whenever deleting a post on this forum, try to remember to edit it first in such a way as to replace it with only five random non-space characters or a message of your choice that meets the minimum character limit. I don't know if this applies to everybody on here, but whenever I create a post that consists only of five fullstop marks (or low dots), it is automatically flagged as private pending the approval of the board administrator. I suspect this may also occur if the posts consists only of five question marks, five exclamation marks, five colons, five semicolons or five of any other symbol that might constitute gibberish when presented in such a way.

10.6K Messages

 • 

224.9K Points

Quoting a hidden post:

@jeorj_euler Shouldn't you also tell people to add j@v@script: at the beginning (spelling it out with the alphabet)?

I suppose my quick answer to that is to explain that I shall wait to see if one my more recent specialized rephrasing of the earlier source code post is approved for publication. I may have to subject the forurm software to a battery of tests, in the mean time.

2.7K Messages

 • 

47K Points

@jeorj_euler​ 

Thank you again for creating the bookmarklet. I am already putting it to good use.

10.6K Messages

 • 

224.9K Points

I see that Sprinklr comment 63ae636048a8017448664f19 was approved for publication, so I may want to delete the earlier one in which the protocol part of the uniform resource identifier is missing due to the automated filter.

2.7K Messages

 • 

47K Points

2 years ago

Some new changes have been made very recently to the keyword search engine that now place popular similar keywords at the top of the list. 

For example, in my post of 21 days ago, I reported that the search for the word "killed" did not return "police-officer-killed" and similar keywords until result #411. Today when I do the same search, "police-officer-killed" is literally at the top of the list, ranked at #1. 

Search "killed"

Keywords

Exact matches

Another change I have noticed is that when you click the "exact matches" button on keyword searches, it will now sometimes return multiple results (not just one result), because it now searches for exact matches without regard to spaces and punctuation.

For example, a search for "videoconferencing" returns both video conferencing and videoconferencing as exact matches.

A search for "voicemail" returns both voice mail and voicemail as exact matches.

A search for "esp" returns both esp and e.s.p. as exact matches.

All of these are good improvements. 

However, back to the first change (placing similar popular keywords at the very top of the list), I actually think this change has gone too far in that direction. 

I would still recommend always placing exact matches at the very top of the search results list regardless of popularity (especially since we are only talking about one or two search results).

And I would still recommend placing keywords that are extremely close in wording (without regard to popularity) to be placed next, after the exact matches.

Then the popular similar results should be mixed in with the results that begin with the same words. 

An appropriate balance should be struck.  The new engine is closer to finding that balance, but it hasn't quite been found yet.

To illustrate my point, consider the following search. Currently, when I search for "trapped in a moving car," the keyword trapped-in-a-driverless-moving-car does not appear until result #35 (and because the results still only display 25 initial results as default, this keyword is not even visible at first). "trapped in a driverless moving car" is so close to "trapped in a moving car" that I would expect it to be higher in the search results. (And I also still think that initial results should include 50 keywords rather than 25 keywords).

Search "trapped in a moving car"

Keywords

Anyway, thank you for the ongoing improvements to the keywords search engine. It is not quite back to where it used to be, but progress is being made.

(edited)

Employee

 • 

27 Messages

 • 

314 Points

@keyword_expert​ -- appreciate all your keyword search-related feature feedback. The team is prioritizing keyword search updates and experiments, and I have shared your additional feature and design feedback for consideration. Thanks, and please reach out at any time.