BERSERK's profile

20 Messages

 • 

458 Points

Saturday, October 26th, 2024 11:09 PM

Solved

Letters

I edited many titles and found that letters like Ö,Ç,Ü are acceptable, but letters like Ş and Ğ are not acceptable.What prevented adding all the letters?

I don't mean adding different languages, just similar letters.

  • S-Ș
  • G-Ğ
  • Ayse-Ayșe

In the end, an English speaker can understand it because it does not change the literal meaning of the word, and at the same time we maintain the official language when transferring the names of characters or the titles of episodes.

Accepted Solution

Champion

 • 

7.4K Messages

 • 

276K Points

27 days ago

Berserk: See https://web.archive.org/web/20060101140203/http://www.imdb.com/Newsletter/newsletter-13#iso for an explanation. That IMDb newsletter is actually from 1997, yet some of it holds true today.

Basically, in 1997, IMDb upgraded its capacities to allow all of the accented letters in the ISO-Latin-1 character set to be used; the accented letters mostly hadn't been available before. (See https://en.wikipedia.org/wiki/ISO/IEC_8859-1 for more about this character set.) ISO-Latin-1 covers substantially all the accented letters in the major Western European languages (and, incidentally, a few major non-European languages such as Indonesian and Tagalog). But it doesn't cover all the accented letters in languages such as Polish, Czech, or Turkish.

In the newsletter linked above, it says,"In the future there will be a new huge standardized 16 bit character set called Unicode. ... We will use it as it becomes widely available and supported by the industry." Unicode would allow data from basically any standard language to be input into IMDb. While IMDb has, over the years, phased in some other character sets for certain uses, it has not fully implemented Unicode yet. I don't know when that will happen, and it might take decades to be accomplished.

Also as stated in the newsletter linked above, regarding the characters Ş and Ğ that you wanted to use:

Galician, Maltese, Turkish and other languages with native character set ISO Latin 3:


Data must be transliterated to ISO Latin 1. The mappings are straightforward. If an accented character is missing in ISO Latin 1 use the non accented version. There is no server support for direct ISO Latin 3 data for the time being.

I hope this helps.

(edited)

20 Messages

 • 

458 Points

Thank you🖤

20 Messages

 • 

458 Points

@owenrees​ Thank you🖤