Employee
•
500 Messages
•
42.2K Points
New rule regarding the display of Chinese and some other Asian names
Until recently, our name formatting rules included a paragraph outlining a requirement to submit Chinese names in Western format (i.e. “family name, first name”). For example, actor Chow Yun-Fat was listed on the site as Yun-Fat Chow (because "Yun-Fat" is the family name)
We have now removed this requirement. Effective immediately, when adding a new name or correcting an existing one, please enter it in the same format it’s supposed to be shown (e.g. “Chow Yun-Fat”) and/or in the way preferred by the person to whom it refers (if known) and our submission system will change it so that it is properly stored and displayed.
jeorj_euler
10.7K Messages
•
225.4K Points
4 years ago
Unless I'm mistaken, I'm a little disappointed that a change to the formatting rules has been undertaken instead of the system being upgraded in such a way that ensures every person's the family name would be displayed before his or her given name, in cases where it would appropriate, even though the names are stored with the family name followed by a comma followed by the given name. As I understand it, this would require an additional data field (i.e. a column) in the filmography item data type, or an equivalent hack. Why not simply provide a way for contributors to override the aspect of the submission form related to the bit about "The following fixes have been applied automatically" (which happens whenever a comma is omitted from the name field)? On a side note, why not provide a way for contributors to input names that contain single-letter names?
2
samofnine
36 Messages
•
486 Points
4 years ago
Most asian/arabic/eastern-european people do not have an official english translation of their names, so there are many people listed on the IMDb that are the same people only under different spellings or one does not find the correct name. There should be a text field for names that allows contributors to add the original name signs/letters like an a.k.a.-section for those names and their pronounciations in different languages like it has been done with the movie titles (if there is no official translation from the person or movie title itself). With the original signs/letters a name or a movie title would be found better or sometimes even at all.
For example: Chow Yun-Fat's original name is 周潤發 - spoken it would be Zhōu Rùn Fā, so not quite the displayed name. But there has been established an international name by media/agents or the person to spell him Chow Yun-Fat. That would be his imdb display name.
Some people have the same original name like 周, but some are spelled Chow, some are spelled Zhou, so it is very difficult to find those people in the database. That's why I propose to establish a db-text field for the original name signs/letters.
2
English_pedantic_grammarian
78 Messages
•
1.5K Points
4 years ago
The “/Asian” is extremely vague as to what comes under this rule and what doesn’t.
What about Japanese names? English Wikipedia and The Movie Database put these by default with the family name last, except with pre-Meiji names and art names of performers of traditional Japanese arts. But the Japanese government is pushing for them to romanised in the order they are pronounced in Japanese (in which the family name is usually first) and some bodies with links to the government like the NHK, UniJapan (including with their JFDB, which is a good resource for finding out the order names are pronounced in Japanese) and various countries’ branches of the Japan Foundation follow this rule.
What about Hungarian names, in which the family name comes first, but which are not Asian?
What about mixed Asian/Western names, like Kang Daniel and Jackie Chan? These are most often written with the family name last, but not always: Kang Daniel is always romanised officially with the family name (Kang) first.
What about people with completely East Asian names but whom live or lived primarily in countries where family names normally come last, like Sessue Hayakawa, Gok Wan, Kazuo Ishiguro or Yoko Ono?
How should the second syllable of a two-syllable name separated by a hyphen be capitalised? Both English Wikipedia and The Movie Database specify that the second syllable should by default not have a capital letter (e.g. Chow Yun-fat), the only exceptions being when it can be proved that the capitalised form is preferred by the person, or is otherwise official or the form conventionally used in English. The example suggests that capitalising both syllables is preferred, but there is no clear statement either way on this issue.
And what about character names – does the change affect them at all?
I am really glad that IMDb is doing something about this, though my enthusiasm might not show from what I’ve written above, as it will eliminate all those “(as …)” attributes which only differ in name order being added and mean that cinemas which copy information from IMDb will be getting these names in the correct order. But such a drastic change in policy as this needs a detailed explanation of what it affects and does not to accompany it.
(edited)
9
RoberMiller
5 Messages
•
104 Points
4 years ago
Nice consideration. It will now be better and convenient for the Chinese/Asian members. Hope it won't affect any functionality in any way.
5
0
English_pedantic_grammarian
78 Messages
•
1.5K Points
4 years ago
After thinking about this for a while (I do mostly correct and add data on Chinese, Japanese and Korean content, so it has an enormous impact on my contributing), I'm wondering from how Giancarlo_Cairella's posts have worded things but not stated it explicitly, if IMDb is preparing to drop any division between family and personal names, storing each name as a single string of text which can only be sorted by the very first letter, as there's no indication in it of which part it should be sorted by?
This is how The Movie Database deals with names, and I can see the logic in it.
It breaks sorting by family name when the family name comes last, but the only part of IMDb that I use which alphabetical sorting majorly affects is the lists of credits in each staff section, which are currently ordered by the people's names. If these were ordered alphabetically by the name of the role, not the person, that would (a) make a lot more sense, in my opinion, and make them easier to read through and (b) mean what part of their names people are sorted by would be less important, as there are typically only a relative few people with exactly the same job name on a production.
Casts are automatically sorted alphabetically when too few of them have order numbers, but that in most cases is only a temporary measure until order numbers are added.
If that change happens, I'm sure many will be outraged. Adding in the function of custom display orders for names might be better, but removing the division might be easier to implement and would still be a big improvement over the current situation, in my opinion.
It would make it far easier to deal not only with different cultures' formats for names but also the names of bands like "Guns n' Roses" (IMDb would consider Roses the family name and sort it under R), people known by titles like "Queen Elizabeth II" (IMDb would consider II the family name and sort it under I), and people who use descriptive stage names like "Meat Loaf" (IMDb would consider Loaf the family name and sort it under L). We humans know that those interpretations are wrong, but non-AI programming cannot tell that.
The same happens to names of Korean people who go by a personal name with a space between the syllables, most famously BTS member Jung Kook. IMDb would consider Kook the family name and sort it under K. Which is like considering Lett to be Scarlett Johansson's family name and sorting her under L. I know that interpretation is wrong, but not only can unintelligent programming not tell that, people who don't know anything about Korean names can also not tell that.
Currently, these "special" names can only be made to be sorted correctly by contacting IMDb staff, which is a hassle for everyone involved. For a non-anglophone name like Jung Kook it requires understanding how names can be formatted in different cultures, which IMDb do not have staff that are experts in for all the cultures of the world, and that means people requesting the names to be corrected have to explain the cultural context each time.
Sorting all names by the very first letter of them is not traditional in most of Europe or America, but the great advantage of doing that over any other conceivable solution is it doesn't require specialist knowledge either on the part of IMDb or contributors.
I'm old enough to remember that IMDb used to require leading articles of English-language titles to go at the end, after a comma: the primary title of The Magnificent Seven would be displayed that way but stored as "Magnificent Seven, The". I suspect this practice was ended because (a) IMDb staff didn't have specialist knowledge of sorting conventions for all the world's languages (which can be complex: French traditionally treats titles beginning with definite and with indefinite articles differently) and (b) the system was getting confused by titles with commas in them, and the number of them needing to be specially corrected by staff was too much for the staff to handle.
That was a break from tradition which it appears everyone is now used to, as I see no one complaining about it and asking it to be retracted in these forums. Maybe simplifying how names are sorted may end up being considered just as normal after a similar amount of time?
0
Mvybuf
111 Messages
•
1.5K Points
3 years ago
I notice nobody mentioned Arabic names in particular... probably because they get so LONG, with 20 or more names in the string, including several REPEATED, particularly prefixes (or whatever you call it, like "von" or "de" in Germanic names, which can open up another naming problem like how do you alphabetize by family name if it's preceded by de or von or van - I often see the name under D for "de" or V for "von" or "van", not to mention that those prefixes are not capitalized usually...)
Where was I? This is Soooo confusing. Okay, so the Arabic prefix I see commonly is "AL" or "EL", but other names, could they be given? middle? family? are sometimes repeated in the full name.
The long Arabic names occur because the full name describes the person's ancestors and linked families. (Reminds me of the Spain convention, carried over into some early latino-american personages, of the family name always including both the father's and mother's (maiden) names separated by "y" (Spanish for "and"). Enrique Granados y Campiña, always listed without the name after the y (I can't remember which is father, which is mother, I think mother is first), so file under G. But what about those who are known more by the second name? Using western comma conventions, the family name given first could be the one AFTER the "y"...)
For another digression, there's the confusing (to me) Slavic practice of adding a suffix to a woman's name, like "a" or "ova" or "aya", sometimes even modifying the final letter to add an "a" BEFORE it, as well as a suffix after it (Tarkovsky -> Tarkovskaya) that in some societies means she's married, in others it's added at birth.
At least with Korean names, you can always tell which is the given name because it's the hyphenated one. Sorta like the first and middle names, hyphenated together. Then there are Chinese names with hyphenated family names, who act in Korean movies...
Even without a hyphen, though, how does one tell which name is family and which is given? Zhang Dong vs Dong Zhang. I can see previous editors have been confused because some of these are the same people, need to be merged.
Oh, and there doesn't even seem to be any common convention when it comes to gender. Zhang or Dong can be male or female. Maybe only the middle name will tell you: Young-jin Jo (actor), Young-ji Jo (actress).
Anyway, how would one figure out what part of a long Arabic name is the family name, when the full name actually contains SEVERAL family names? Check out Star Trek Deep Space Nine's Alexander Siddig https://www.imdb.com/name/nm0796502/bio credited originally as Siddig El Fadil, with "Siddig" technically (?) being his (or one of his) family name, later changing his credit to Alexander Siddig, by western conventions, even though "Alexander" doesn't even feature in his full name (he just made it up), which is (get ready) Siddig El Tahir El Fadil El Siddig Abderrahman Mohammed Ahmed Abdel Karim El Mahdi (I love it). And his nickname is Sid, so uh maybe Siddig is a given name not a family name... Or maybe it's BOTH?? It occurs once at the beginning, and again later with an EL prefix, so I dunno... Good thing he doesn't insist on being listed with his full name, or the IMDb computers would have the CPU equivalent of an electron heart attack.
Better not think about it too much. Just list names any old way. That seems to be the reasoning behind IMDb lifting the restrictions on Asian names. I agree though that the comma should now be considered obsolete, and the editing form shouldn't automatically reformat the name you entered without a comma by switching the names around and adding it.
However, it would be nice to know what a person's family name was. I feel a bit weird referring to Sion Sono (or Sono Sion) as just "Sono" or "Sion" if one of those is his given name (I can never remember which!); it's a bit too forward and disrespectful, especially considering Japanese cultural politeness.
Sometimes it's not confusing at all. For instance I've never seen Akira Kurosawa's name in conventional Asian format "Kurosawa Akira" looks just odd to me, and I'd never speak it that way, even if it's more correct.
(edited)
1
plur62
278 Messages
•
6K Points
1 year ago
I apologize for bringing up this topic, but actor Chow Yun-fat's family name is Chow, not Yun-fat as written in the opening post.
(because "Yun-Fat" is the family name)
1
plur62
278 Messages
•
6K Points
1 year ago
By the way, there are no rules for Korean names on the site, so there are thousands where the family name comes first (as it should be) and also thousands where the given name comes first, it's a complete mess.
I'm trying to fix it, but it's a lot of work, and my contributions are often declined for no reason.
(edited)
3