Soluling home   Document home

Language Id Format

Language Id Format

Use this dialog to specify what kind of language ids are used when writing files.

Language id format

Specifies how to format language ids used in xml:lang attributes. Possible values are:

Value Description
Standard: la[-Ssss][-CC] Standard language id that contains an ISO 639-1 language id (la) and optional script id (Ssss) and/or ISO-3166 country id (CC).
Legacy: la[-CC] Legacy language id that contains an ISO 639-1 language id (la) and an optional ISO-3166 country id (CC). If the id uses the secondary script of the language (e.g., Traditional Chinese), then the country id is required.
Underline: ll[_CC] As above, but separator character is underline, "_", instead of the hyphen, "-".

It is recommended to use the default language ids (either Use the same format as in the original file or Standard). Use other formats only if your files are consumed by a system that requires legacy ids.

Language id case

Specifies the case of the language attributes that are used in the file. Possible values are:

Value Description
Standard (la-CC) The language part is written in lower case. The optional country part is written in the upper case. For example, en-US is for English in the United States. This is the default value.
Lower (la-cc) Both language and country parts are written in lower case. For example en-us
Upper (LL-CC) Both language and country parts are written in upper case. For example EN-US

Include script into ids

Specifies when the script part is included in the language id. This is visible when language id format is set to Standard. Possible values are:

Value Description
Standard Script id is included if it is absolutely needed to make id unambiguous. For example, English is en, Simplified Chinese is zh, and Traditional Chinese is zh-Hant.
On multiple scripts Script id is included if the language can use more than one script. For example, English is en, Simplified Chinese is zh-Hans, and Traditional Chinese is zh-Hant.
Always Script id is always included. For example, English is en-Latn, Simplified Chinese is zh-Hans, and Traditional Chinese is zh-Hant.

Include country into multi-script ids

Specifies when the script part is included in the language id. This is visible when language id format is set to Legacy or Underline. Possible values are:

Value Description
No A country id is not added if not implicational given. For example, English is en, Simplified Chinese is zh, and Traditional Chinese is zh-TW.
Yes A country id is included if the language has more than one possible script. For example, English is en, Simplified Chinese is zh-CN, and Traditional Chinese is zh-TW.