gromit82's profile
Champion

Champion

 • 

7.4K Messages

 • 

276K Points

Wednesday, April 20th, 2022 3:27 AM

No Status

6

Unicode - 25-year update

IMDb used to have a newsletter, and the mid-April 1997 issue contained an item titled "The Great ISO Swap" reporting that IMDb had implemented the ISO 8859-1 (also known as ISO Latin-1) character set, allowing names and titles to use all the common letters with diacritical marks of the major Western European languages (such as å, ç, é, ï, ñ, ô, and ù).

http://web.archive.org/web/20060101140203/http://www.imdb.com/Newsletter/newsletter-13#iso

Near the end of the item the following statement appeared:

Ideally all data should be presented using its native character sets/ pictograms. Technically this is not possible though with current widespread software for web access, e-mail and operating systems in general.

In the future there will be a new huge standardized 16 bit character set called Unicode. It will offer the capability to freely combine Japanese Kanji with ISO 1 text and Hindi, for example. We will use it as it becomes widely available and supported by the industry.

I note that some additional character sets have been made available for the Alternate Titles section over the last few years (among them Greek, Chinese, Japanese, Korean, and Cyrillic), and I personally am not that affected by the lack of full Unicode support. However, I know that some contributors here would like to see further progress made in terms of implementing Unicode, so I am bringing this up to mark the 25 years since IMDb announced plans to implement it.

8 Messages

 • 

180 Points

3 years ago

When entire world is moving away from proprietary fonts towards will Unicode, IMDb still strictly bans the use of Unicode characters

What is the reason? How does the use of Unicode character harms the website?

When will Unicode character be allowed to be used at the site?

Thanks.

Note: This comment was created from a merged conversation originally titled When will Unicode character be allowed?

1 Message

 • 

60 Points

3 years ago

A good example of the importance of full Unicode support:

The Romanian film "Față în Față" ("Face to Face") becomes "Fata în Fata" on IMDb - "The Girl in the Girl" ! The is also how "Face/Off" is known in Romania, apparently...

8 Messages

 • 

180 Points

3 years ago

Without Unicode we cannot even use Latin alphabet, and entire astronomy has major use of that. Most of stars have Latin alphabet in their names, so we cannot properly mention the star without Unicode, have to write full name of that Latin letter. There are so many astronomy related movies that would need mention of Latin alphabet.