[go: nahoru, domu]

Skip Headers

Oracle® Database Globalization Support Guide
10g Release 1 (10.1)

Part Number B10749-01
Go to Documentation Home
Home
Go to Book List
Book List
Go to Table of Contents
Contents
Go to Index
Index
Go to Master Index
Master Index
Go to Feedback page
Feedback

Go to previous page
Previous
Go to next page
Next
View PDF

A
Locale Data

This appendix lists the languages, territories, character sets, and other locale data supported by the Oracle server. It includes these topics:

You can obtain information about character sets, languages, territories, and sorting orders by querying the V$NLS_VALID_VALUES dynamic performance view.

See Also:

Oracle Database Reference for more information about the data that can be returned by this view

Languages

Languages inTable A-1 provide support for locale-sensitive information such as the following:

By using Unicode databases and datatypes, you can store, process, and retrieve data for almost all contemporary languages, including many that do not appear in Table A-1.

Table A-1 Oracle Supported Languages  
Language Name Language Abbreviation Default Sort

AMERICAN

us

binary

ARABIC

ar

ARABIC

ASSAMESE

as

binary

AZERBAIJANI

az

AZERBAIJANI

BANGLA

bn

binary

BRAZILIAN PORTUGUESE

ptb

WEST_EUROPEAN

BULGARIAN

bg

BULGARIAN

CANADIAN FRENCH

frc

CANADIAN FRENCH

CATALAN

ca

CATALAN

CROATIAN

hr

CROATIAN

CYRILLIC KAZAKH

ckk

GENERIC_M

CYRILLIC SERBIAN

csr

GENERIC_M

CYRILLIC UZBEK

cuz

GENERIC_M

CZECH

cs

CZECH

DANISH

dk

DANISH

DUTCH

nl

DUTCH

EGYPTIAN

eg

ARABIC

ENGLISH

gb

binary

ESTONIAN

et

ESTONIAN

FINNISH

sf

FINNISH

FRENCH

f

FRENCH

GERMAN DIN

din

GERMAN

GERMAN

d

GERMAN

GREEK

el

GREEK

GUJARATI

gu

binary

HEBREW

iw

HEBREW

HINDI

hi

binary

HUNGARIAN

hu

HUNGARIAN

ICELANDIC

is

ICELANDIC

INDONESIAN

in

INDONESIAN

ITALIAN

i

WEST_EUROPEAN

JAPANESE

ja

binary

KANNADA

kn

binary

KOREAN

ko

binary

LATIN AMERICAN SPANISH

esa

SPANISH

LATIN SERBIAN

lsr

binary

LATIN UZBEK

luz

GENERIC_M

LATVIAN

lv

LATVIAN

LITHUANIAN

lt

LITHUANIAN

MACEDONIAN

mk

binary

MALAY

ms

MALAY

MALAYALAM

ml

binary

MARATHI

mr

binary

MEXICAN SPANISH

esm

WEST_EUROPEAN

NORWEGIAN

n

NORWEGIAN

ORIYA

or

binary

POLISH

pl

POLISH

PORTUGUESE

pt

WEST_EUROPEAN

PUNJABI

pa

binary

ROMANIAN

ro

ROMANIAN

RUSSIAN

ru

RUSSIAN

SIMPLIFIED CHINESE

zhs

binary

SLOVAK

sk

SLOVAK

SLOVENIAN

sl

SLOVENIAN

SPANISH

e

SPANISH

SWEDISH

s

SWEDISH

TAMIL

ta

binary

TELUGU

te

binary

THAI

th

THAI_DICTIONARY

TRADITIONAL CHINESE

zht

binary

TURKISH

tr

TURKISH

UKRAINIAN

uk

UKRAINIAN

VIETNAMESE

vn

VIETNAMESE

Translated Messages

Oracle error messages have been translated into the languages which are listed in Table A-2.

Table A-2 Oracle Supported Messages  
Name Abbreviation

ARABIC

ar

BRAZILIAN PORTUGUESE

ptb

CATALAN

ca

CZECH

cs

DANISH

dk

DUTCH

nl

FINNISH

sf

FRENCH

f

GERMAN

d

GREEK

el

HEBREW

iw

HUNGARIAN

hu

ITALIAN

i

JAPANESE

ja

KOREAN

ko

NORWEGIAN

n

POLISH

pl

PORTUGUESE

pt

ROMANIAN

ro

RUSSIAN

ru

SIMPLIFIED CHINESE

zhs

SLOVAK

sk

SPANISH

e

SWEDISH

s

THAI

th

TRADITIONAL CHINESE

zht

TURKISH

tr

Territories

Table A-3 lists the territories supported by the Oracle server.

Table A-3 Oracle Supported Territories  
Name Name Name

ALGERIA

HONG KONG

PERU

-

-

PHILIPPINES

AMERICA

HUNGARY

POLAND

AUSTRALIA

ICELAND

PORTUGAL

AUSTRIA

INDIA

PUERTO RICO

BAHRAIN

INDONESIA

QATAR

BANGLADESH

IRAQ

ROMANIA

-

-

RUSSIA

BELGIUM

IRELAND

SAUDI ARABIA

-

-

SERBIA AND MONTENEGRO

BRAZIL

ISRAEL

SINGAPORE

BULGARIA

ITALY

SLOVAKIA

CANADA

JAPAN

SLOVENIA

CATALONIA

JORDAN

SOMALIA

CHILE

KAZAKHSTAN

SOUTH AFRICA

CHINA

KOREA

SPAIN

ARGENTINA

-

-

AZERBAIJAN

KUWAIT

SUDAN

COLOMBIA

LATVIA

SWEDEN

COSTA RICA

LEBANON

SWITZERLAND

CROATIA

LIBYA

SYRIA

CYPRUS

LITHUANIA

TAIWAN

CZECH REPUBLIC

LUXEMBOURG

THAILAND

DENMARK

-

THE NETHERLANDS

DJIBOUTI

MALAYSIA

TUNISIA

ECUADOR

-

-

EGYPT

MAURITANIA

TURKEY

EL SALVADOR

MEXICO

UKRAINE

ESTONIA

MOROCCO

UNITED ARAB EMIRATES

FINLAND

NEW ZEALAND

UNITED KINGDOM

FRANCE

NICARAGUA

UZBEKISTAN

FYR MACEDONIA

-

VENEZUELA

GUATEMALA

NORWAY

VIETNAM

GERMANY

OMAN

YEMEN

GREECE

PANAMA

-

Character Sets

Oracle-supported character sets are listed in the following sections according to three broad language groups.

In addition, common subset/superset combinations are listed.

Note that some character sets may be listed under multiple language groups because they provide multilingual support. For instance, Unicode spans the Asian, European, and Middle Eastern language groups because it supports most of the major scripts of the world.

The comment section indicates the type of encoding used:

SB = Single-byte encoding
MB = Multibyte encoding
FIXED = Fixed-width multibyte encoding

As mentioned in Chapter 3, "Setting Up a Globalization Support Environment", the type of encoding affects performance, so use the most efficient encoding that meets your language needs. Also, some encoding types can only be used with certain data types. For instance, the AL16UTF16 character set can only be used as an NCHAR character set, and not as a database character set.

Also documented in the comment section are other unique features of the character set that may be important to users or your database administrator. For example, the information includes whether the character set supports the euro currency symbol, whether user-defined characters are supported, and whether the character set is a strict superset of ASCII. (You can use the ALTER DATABASE CHARACTER SET statement to migrate from the US7ASCII character set to a character set that is a strict superset of ASCII.)

The following is the key for the comment column of the character set tables:

EURO: euro symbol supported
UDC: user-defined characters supported
ASCII: strict superset of ASCII
MB: multibyte
SB: single-byte

Oracle does not document individual code page layouts. For specific details about a particular character set, its character repertoire, and code point values, you can use Oracle Locale Builder may be used. Otherwise, you should refer to the actual national, international, or vendor-specific standards.

See Also:

Chapter 13, "Customizing Locale"

Asian Language Character Sets

Table A-4 lists the Oracle character sets that can support Asian languages. The list is alphabetical by description.

Table A-4 Asian Language Character Sets  
Name Description Comments

BN8BSCII

Bangladesh National Code 8-bit BSCII

SB, ASCII

ZHT16BIG5

BIG5 16-bit Traditional Chinese

MB, ASCII

ZHS16CGB231280

CGB2312-80 16-bit Simplified Chinese

MB, ASCII

JA16EUC

EUC 24-bit Japanese

MB, ASCII

JA16EUCTILDE

The same as JA16EUC except for the way that the wave dash and the tilde are mapped to and from Unicode.

MB, ASCII

JA16EUCYEN

EUC 24-bit Japanese with '\' mapped to the Japanese yen character

MB

ZHT32EUC

EUC 32-bit Traditional Chinese

MB, ASCII

ZHS16GBK

GBK 16-bit Simplified Chinese

MB, ASCII, UDC

ZHS32GB18030

GB18030-2000

MB, ASCII, EURO

ZHT16CCDC

HP CCDC 16-bit Traditional Chinese

MB, ASCII

JA16DBCS

IBM EBCDIC 16-bit Japanese

MB, UDC

JA16EBCDIC930

IBM DBCS Code Page 290 16-bit Japanese

MB, UDC

KO16DBCS

IBM EBCDIC 16-bit Korean

MB, UDC

ZHS16DBCS

IBM EBCDIC 16-bit Simplified Chinese

MB, UDC

ZHT16DBCS

IBM EBCDIC 16-bit Traditional Chinese

MB, UDC

JA16VMS

JVMS 16-bit Japanese

MB, ASCII

KO16KSC5601

KSC5601 16-bit Korean

MB, ASCII

KO16KSCCS

KSCCS 16-bit Korean

MB, ASCII

ZHS16MACCGB231280

Mac client CGB2312-80 16-bit Simplified Chinese

MB

JA16MACSJIS

Mac client Shift-JIS 16-bit Japanese

MB

TH8MACTHAI

Mac Client 8-bit Latin/Thai

SB

TH8MACTHAIS

Mac Server 8-bit Latin/Thai

SB, ASCII

TH8TISEBCDICS

Thai Industrial Standard 620-2533-EBCDIC Server 8-bit

SB

KO16MSWIN949

MS Windows Code Page 949 Korean

MB, ASCII, UDC

ZHT16HKSCS

MS Windows Code Page 950 with Hong Kong Supplementary Character Set HKSCS-2001 (character set conversion to and from Unicode is based on Unicode 3.0)

MB, ASCII, EURO

ZHT16MSWIN950

MS Windows Code Page 950 Traditional Chinese

MB, ASCII, UDC

VN8MSWIN1258

MS Windows Code Page 1258 8-bit Vietnamese

SB, ASCII, EURO

IN8ISCII

Multiple-Script Indian Standard 8-bit Latin/Indian Languages

SB, ASCII

JA16SJIS

Shift-JIS 16-bit Japanese

MB, ASCII, UDC

JA16SJISTILDE

The same as JA16SJIS except for the way that the wave dash and the tilde are mapped to and from Unicode.

MB, ASCII, UDC

JA16SJISYEN

Shift-JIS 16-bit Japanese with '\' mapped to the Japanese yen character

MB, UDC

ZHT32SOPS

SOPS 32-bit Traditional Chinese

MB, ASCII

ZHT16DBT

Taiwan Taxation 16-bit Traditional Chinese

MB, ASCII

TH8TISASCII

Thai Industrial Standard 620-2533 - ASCII 8-bit

SB, ASCII, EURO

TH8TISEBCDIC

Thai Industrial Standard 620-2533 - EBCDIC 8-bit

SB

ZHT32TRIS

TRIS 32-bit Traditional Chinese

MB, ASCII

AL16UTF16

See "Universal Character Sets" for details

MB, EURO, FIXED

AL32UTF8

See "Universal Character Sets" for details

MB, ASCII, EURO

UTF8

See "Universal Character Sets" for details

MB, ASCII, EURO

UTFE

See "Universal Character Sets" for details

MB, EURO

VN8VN3

VN3 8-bit Vietnamese

SB, ASCII

European Language Character Sets

Table A-5 lists the Oracle character sets that can support European languages. The list is alphabetical by description.

Table A-5 European Language Character Sets  
Name Description Comments

US7ASCII

ASCII 7-bit American

SB, ASCII

SF7ASCII

ASCII 7-bit Finnish

SB

YUG7ASCII

ASCII 7-bit Yugoslavian

SB

RU8BESTA

BESTA 8-bit Latin/Cyrillic

SB, ASCII

EL8GCOS7

Bull EBCDIC GCOS7 8-bit Greek

SB

WE8GCOS7

Bull EBCDIC GCOS7 8-bit West European

SB

EL8DEC

DEC 8-bit Latin/Greek

SB

TR7DEC

DEC VT100 7-bit Turkish

SB

TR8DEC

DEC 8-bit Turkish

SB, ASCII

WE8DEC

DEC 8-bit West European

SB, ASCII

D7DEC

DEC VT100 7-bit German

SB

F7DEC

DEC VT100 7-bit French

SB

S7DEC

DEC VT100 7-bit Swedish

SB

E7DEC

DEC VT100 7-bit Spanish

SB

NDK7DEC

DEC VT100 7-bit Norwegian/Danish

SB

I7DEC

DEC VT100 7-bit Italian

SB

NL7DEC

DEC VT100 7-bit Dutch

SB

CH7DEC

DEC VT100 7-bit Swiss (German/French)

SB

SF7DEC

DEC VT100 7-bit Finnish

SB

WE8DG

DG 8-bit West European

SB, ASCII

WE8EBCDIC37C

EBCDIC Code Page 37 8-bit Oracle/c

SB

WE8EBCDIC37

EBCDIC Code Page 37 8-bit West European

SB

D8EBCDIC273

EBCDIC Code Page 273/1 8-bit Austrian German

SB

DK8EBCDIC277

EBCDIC Code Page 277/1 8-bit Danish

SB

S8EBCDIC278

EBCDIC Code Page 278/1 8-bit Swedish

SB

I8EBCDIC280

EBCDIC Code Page 280/1 8-bit Italian

SB

WE8EBCDIC284

EBCDIC Code Page 284 8-bit Latin American/Spanish

SB

WE8EBCDIC285

EBCDIC Code Page 285 8-bit West European

SB

WE8EBCDIC1047

EBCDIC Code Page 1047 8-bit West European

SB

WE8EBCDIC1140

EBCDIC Code Page 1140 8-bit West European

SB, EURO

WE8EBCDIC1140C

EBCDIC Code Page 1140 Client 8-bit West European

SB, EURO

WE8EBCDIC1145

EBCDIC Code Page 1145 8-bit West European

SB, EURO

WE8EBCDIC1146

EBCDIC Code Page 1146 8-bit West European

SB, EURO

WE8EBCDIC1148

EBCDIC Code Page 1148 8-bit West European

SB, EURO

WE8EBCDIC1148C

EBCDIC Code Page 1148 Client 8-bit West European

SB, EURO

F8EBCDIC297

EBCDIC Code Page 297 8-bit French

SB

WE8EBCDIC500C

EBCDIC Code Page 500 8-bit Oracle/c

SB

WE8EBCDIC500

EBCDIC Code Page 500 8-bit West European

SB

EE8EBCDIC870

EBCDIC Code Page 870 8-bit East European

SB

EE8EBCDIC870C

EBCDIC Code Page 870 Client 8-bit East European

SB

EE8EBCDIC870S

EBCDIC Code Page 870 Server 8-bit East European

SB

WE8EBCDIC871

EBCDIC Code Page 871 8-bit Icelandic

SB

EL8EBCDIC875

EBCDIC Code Page 875 8-bit Greek

SB

EL8EBCDIC875R

EBCDIC Code Page 875 Server 8-bit Greek

SB

CL8EBCDIC1025

EBCDIC Code Page 1025 8-bit Cyrillic

SB

CL8EBCDIC1025C

EBCDIC Code Page 1025 Client 8-bit Cyrillic

SB

CL8EBCDIC1025R

EBCDIC Code Page 1025 Server 8-bit Cyrillic

SB

CL8EBCDIC1025S

EBCDIC Code Page 1025 Server 8-bit Cyrillic

SB

CL8EBCDIC1025X

EBCDIC Code Page 1025 (Modified) 8-bit Cyrillic

SB

TR8EBCDIC1026

EBCDIC Code Page 1026 8-bit Turkish

SB

TR8EBCDIC1026S

EBCDIC Code Page 1026 Server 8-bit Turkish

SB

BLT8EBCDIC1112

EBCDIC Code Page 1112 8-bit Baltic Multilingual

SB

BLT8EBCDIC1112S

EBCDIC Code Page 1112 8-bit Server Baltic Multilingual

SB

D8EBCDIC1141

EBCDIC Code Page 1141 8-bit Austrian German

SB, EURO

DK8EBCDIC1142

EBCDIC Code Page 1142 8-bit Danish

SB, EURO

S8EBCDIC1143

EBCDIC Code Page 1143 8-bit Swedish

SB, EURO

I8EBCDIC1144

EBCDIC Code Page 1144 8-bit Italian

SB, EURO

F8EBCDIC1147

EBCDIC Code Page 1147 8-bit French

SB, EURO

CL8EBCDIC1158

EBCDIC Code Page 1158 8-bit Cyrillic

SB

CL8EBCDIC1158R

EBCDIC Code Page 1158 Server 8-bit Cyrillic

SB

EEC8EUROASCI

EEC Targon 35 ASCI West European/Greek

SB

EEC8EUROPA3

EEC EUROPA3 8-bit West European/Greek

SB

LA8PASSPORT

German Government Printer 8-bit All-European Latin

SB, ASCII

WE8HP

HP LaserJet 8-bit West European

SB

WE8ROMAN8

HP Roman8 8-bit West European

SB, ASCII

HU8CWI2

Hungarian 8-bit CWI-2

SB, ASCII

HU8ABMOD

Hungarian 8-bit Special AB Mod

SB, ASCII

EL8EBCDIC423R

IBM EBCDIC Code Page 423 for RDBMS server-side

SB

LV8RST104090

IBM-PC Alternative Code Page 8-bit Latvian (Latin/Cyrillic)

SB, ASCII

US8PC437

IBM-PC Code Page 437 8-bit American

SB, ASCII

BG8PC437S

IBM-PC Code Page 437 8-bit (Bulgarian Modification)

SB, ASCII

EL8PC437S

IBM-PC Code Page 437 8-bit (Greek modification)

SB, ASCII

EL8PC737

IBM-PC Code Page 737 8-bit Greek/Latin

SB

LT8PC772

IBM-PC Code Page 772 8-bit Lithuanian (Latin/Cyrillic)

SB, ASCII

LT8PC774

IBM-PC Code Page 774 8-bit Lithuanian (Latin)

SB, ASCII

BLT8PC775

IBM-PC Code Page 775 8-bit Baltic

SB, ASCII

WE8PC850

IBM-PC Code Page 850 8-bit West European

SB, ASCII

EL8PC851

IBM-PC Code Page 851 8-bit Greek/Latin

SB, ASCII

EE8PC852

IBM-PC Code Page 852 8-bit East European

SB, ASCII

RU8PC855

IBM-PC Code Page 855 8-bit Latin/Cyrillic

SB, ASCII

TR8PC857

IBM-PC Code Page 857 8-bit Turkish

SB, ASCII

WE8PC858

IBM-PC Code Page 858 8-bit West European

SB, ASCII, EURO

WE8PC860

IBM-PC Code Page 860 8-bit West European

SB. ASCII

IS8PC861

IBM-PC Code Page 861 8-bit Icelandic

SB, ASCII

CDN8PC863

IBM-PC Code Page 863 8-bit Canadian French

SB, ASCII

N8PC865

IBM-PC Code Page 865 8-bit Norwegian

SB. ASCII

RU8PC866

IBM-PC Code Page 866 8-bit Latin/Cyrillic

SB, ASCII

EL8PC869

IBM-PC Code Page 869 8-bit Greek/Latin

SB, ASCII

LV8PC1117

IBM-PC Code Page 1117 8-bit Latvian

SB, ASCII

US8ICL

ICL EBCDIC 8-bit American

SB

WE8ICL

ICL EBCDIC 8-bit West European

SB

WE8ISOICLUK

ICL special version ISO8859-1

SB

WE8ISO8859P1

ISO 8859-1 West European

SB, ASCII

EE8ISO8859P2

ISO 8859-2 East European

SB, ASCII

SE8ISO8859P3

ISO 8859-3 South European

SB, ASCII

NEE8ISO8859P4

ISO 8859-4 North and North-East European

SB, ASCII

CL8ISO8859P5

ISO 8859-5 Latin/Cyrillic

SB, ASCII

AR8ISO8859P6

ISO 8859-6 Latin/Arabic

SB, ASCII

EL8ISO8859P7

ISO 8859-7 Latin/Greek

SB, ASCII, EURO

IW8ISO8859P8

ISO 8859-8 Latin/Hebrew

SB, ASCII

NE8ISO8859P10

ISO 8859-10 North European

SB, ASCII

BLT8ISO8859P13

ISO 8859-13 Baltic

SB, ASCII

CEL8ISO8859P14

ISO 8859-13 Celtic

SB, ASCII

WE8ISO8859P15

ISO 8859-15 West European

SB, ASCII, EURO

LA8ISO6937

ISO 6937 8-bit Coded Character Set for Text Communication

SB, ASCII

CL8ISOIR111

ISOIR111 Cyrillic

SB

IW7IS960

Israeli Standard 960 7-bit Latin/Hebrew

SB

CL8KOI8U

KOI8 Ukrainian Cyrillic

SB

WE8EBCDIC1047E

Latin 1/Open Systems 1047

SB, EBCDIC, EURO

WE8EBCDIC924

Latin 9 EBCDIC 924

SB, EBCDIC

BLT8CP921

Latvian Standard LVS8-92(1) Windows/Unix 8-bit Baltic

SB, ASCII

LV8PC8LR

Latvian Version IBM-PC Code Page 866 8-bit Latin/Cyrillic

SB, ASCII

AR8ARABICMAC

Mac Client 8-bit Latin/Arabic

SB

EE8MACCE

Mac Client 8-bit Central European

SB

EE8MACCROATIAN

Mac Client 8-bit Croatian

SB

WE8MACROMAN8

Mac Client 8-bit Extended Roman8 West European

SB

EL8MACGREEK

Mac Client 8-bit Greek

SB

IS8MACICELANDIC

Mac Client 8-bit Icelandic

SB

CL8MACCYRILLIC

Mac Client 8-bit Latin/Cyrillic

SB

TR8MACTURKISH

Mac Client 8-bit Turkish

SB

TR8MACTURKISHS

Mac Server 8-bit Turkish

SB, ASCII

AR8ARABICMACS

Mac Server 8-bit Latin/Arabic

SB, ASCII

EE8MACCES

Mac Server 8-bit Central European

SB, ASCII

EE8MACCROATIANS

Mac Server 8-bit Croatian

SB, ASCII

WE8MACROMAN8S

Mac Server 8-bit Extended Roman8 West European

SB, ASCII

CL8MACCYRILLICS

Mac Server 8-bit Latin/Cyrillic

SB, ASCII

EL8MACGREEKS

Mac Server 8-bit Greek

SB, ASCII

IS8MACICELANDICS

Mac Server 8-bit Icelandic

SB

BG8MSWIN

MS Windows 8-bit Bulgarian Cyrillic

SB, ASCII

LT8MSWIN921

MS Windows Code Page 921 8-bit Lithuanian

SB, ASCII

ET8MSWIN923

MS Windows Code Page 923 8-bit Estonian

SB, ASCII

EE8MSWIN1250

MS Windows Code Page 1250 8-bit East European

SB, ASCII, EURO

CL8MSWIN1251

MS Windows Code Page 1251 8-bit Latin/Cyrillic

SB, ASCII, EURO

WE8MSWIN1252

MS Windows Code Page 1252 8-bit West European

SB, ASCII, EURO

EL8MSWIN1253

MS Windows Code Page 1253 8-bit Latin/Greek

SB, ASCII, EURO

TR8MSWIN1254

MS Windows Code Page 1254 8-bit Turkish

SB, ASCII, EURO

BLT8MSWIN1257

MS Windows Code Page 1257 8-bit Baltic

SB, ASCII, EURO

WE8NCR4970

NCR 4970 8-bit West European

SB, ASCII

WE8NEXTSTEP

NeXTSTEP PostScript 8-bit West European

SB, ASCII

CL8KOI8R

RELCOM Internet Standard 8-bit Latin/Cyrillic

SB, ASCII

US8BS2000

Siemens 9750-62 EBCDIC 8-bit American

SB

DK8BS2000

Siemens 9750-62 EBCDIC 8-bit Danish

SB

F8BS2000

Siemens 9750-62 EBCDIC 8-bit French

SB

D8BS2000

Siemens 9750-62 EBCDIC 8-bit German

SB

E8BS2000

Siemens 9750-62 EBCDIC 8-bit Spanish

SB

S8BS2000

Siemens 9750-62 EBCDIC 8-bit Swedish

SB

DK7SIEMENS9780X

Siemens 97801/97808 7-bit Danish

SB

F7SIEMENS9780X

Siemens 97801/97808 7-bit French

SB

D7SIEMENS9780X

Siemens 97801/97808 7-bit German

SB

I7SIEMENS9780X

Siemens 97801/97808 7-bit Italian

SB

N7SIEMENS9780X

Siemens 97801/97808 7-bit Norwegian

SB

E7SIEMENS9780X

Siemens 97801/97808 7-bit Spanish

SB

S7SIEMENS9780X

Siemens 97801/97808 7-bit Swedish

SB

EE8BS2000

Siemens EBCDIC.DF.04 8-bit East European

SB

WE8BS2000

Siemens EBCDIC.DF.04 8-bit West European

SB

WE8BS2000E

Siemens EBCDIC.DF.04 8-bit West European

SB, EURO

CL8BS2000

Siemens EBCDIC.EHC.LC 8-bit Cyrillic

SB

WE8BS2000L5

Siemens EBCDIC.DF.L5 8-bit West European/Turkish

SB

CE8BS2000

Siemens EBCDIC.DF.04 8-bit Celtic

SB

AL16UTF16

See "Universal Character Sets" for details

MB, EURO, FIXED

AL32UTF8

See "Universal Character Sets" for details

MB, ASCII, EURO

UTF8

See "Universal Character Sets" for details

MB, ASCII, EURO

UTFE

See "Universal Character Sets" for details

MB, EURO

Middle Eastern Language Character Sets

Table A-6 lists the Oracle character sets that can support Middle Eastern languages. The list is alphabetical by description.

Table A-6 Middle Eastern Character Sets  
Name Description Comments

AR8APTEC715

APTEC 715 Server 8-bit Latin/Arabic

SB, ASCII

AR8APTEC715T

APTEC 715 8-bit Latin/Arabic

SB

AR8ASMO708PLUS

ASMO 708 Plus 8-bit Latin/Arabic

SB, ASCII

AR8ASMO8X

ASMO Extended 708 8-bit Latin/Arabic

SB, ASCII

AR8ADOS710

Arabic MS-DOS 710 Server 8-bit Latin/Arabic

SB, ASCII

AR8ADOS710T

Arabic MS-DOS 710 8-bit Latin/Arabic

SB

AR8ADOS720

Arabic MS-DOS 720 Server 8-bit Latin/Arabic

SB, ASCII

AR8ADOS720T

Arabic MS-DOS 720 8-bit Latin/Arabic

SB

TR7DEC

DEC VT100 7-bit Turkish

SB

TR8DEC

DEC 8-bit Turkish

SB

WE8EBCDIC37C

EBCDIC Code Page 37 8-bit Oracle/c

SB

IW8EBCDIC424

EBCDIC Code Page 424 8-bit Latin/Hebrew

SB

IW8EBCDIC424S

EBCDIC Code Page 424 Server 8-bit Latin/Hebrew

SB

WE8EBCDIC500C

EBCDIC Code Page 500 8-bit Oracle/c

SB

IW8EBCDIC1086

EBCDIC Code Page 1086 8-bit Hebrew

SB

AR8EBCDIC420S

EBCDIC Code Page 420 Server 8-bit Latin/Arabic

SB

AR8EBCDICX

EBCDIC XBASIC Server 8-bit Latin/Arabic

SB

TR8EBCDIC1026

EBCDIC Code Page 1026 8-bit Turkish

SB

TR8EBCDIC1026S

EBCDIC Code Page 1026 Server 8-bit Turkish

SB

AR8HPARABIC8T

HP 8-bit Latin/Arabic

SB

TR8PC857

IBM-PC Code Page 857 8-bit Turkish

SB, ASCII

IW8PC1507

IBM-PC Code Page 1507/862 8-bit Latin/Hebrew

SB, ASCII

AR8ISO8859P6

ISO 8859-6 Latin/Arabic

SB, ASCII

IW8ISO8859P8

ISO 8859-8 Latin/Hebrew

SB, ASCII

AZ8ISO8859PE

ISO 8859-9 Latin Azerbaijani

SB, ASCII

WE8ISO8859P9

ISO 8859-9 West European & Turkish

SB, ASCII

LA8ISO6937

ISO 6937 8-bit Coded Character Set for Text Communication

SB, ASCII

IW7IS960

Israeli Standard 960 7-bit Latin/Hebrew

SB

IW8MACHEBREW

Mac Client 8-bit Hebrew

SB

AR8ARABICMAC

Mac Client 8-bit Latin/Arabic

SB

AR8ARABICMACT

Mac 8-bit Latin/Arabic

SB

TR8MACTURKISH

Mac Client 8-bit Turkish

SB

IW8MACHEBREWS

Mac Server 8-bit Hebrew

SB, ASCII

AR8ARABICMACS

Mac Server 8-bit Latin/Arabic

SB, ASCII

TR8MACTURKISHS

Mac Server 8-bit Turkish

SB, ASCII

TR8MSWIN1254

MS Windows Code Page 1254 8-bit Turkish

SB, ASCII, EURO

IW8MSWIN1255

MS Windows Code Page 1255 8-bit Latin/Hebrew

SB, ASCII, EURO

AR8MSWIN1256

MS Windows Code Page 1256 8-Bit Latin/Arabic

SB. ASCII, EURO

IN8ISCII

Multiple-Script Indian Standard 8-bit Latin/Indian Languages

SB

AR8MUSSAD768

Mussa'd Alarabi/2 768 Server 8-bit Latin/Arabic

SB, ASCII

AR8MUSSAD768T

Mussa'd Alarabi/2 768 8-bit Latin/Arabic

SB

AR8NAFITHA711

Nafitha Enhanced 711 Server 8-bit Latin/Arabic

SB, ASCII

AR8NAFITHA711T

Nafitha Enhanced 711 8-bit Latin/Arabic

SB

AR8NAFITHA721

Nafitha International 721 Server 8-bit Latin/Arabic

SB, ASCII

AR8NAFITHA721T

Nafitha International 721 8-bit Latin/Arabic

SB

AR8SAKHR706

SAKHR 706 Server 8-bit Latin/Arabic

SB, ASCII

AR8SAKHR707

SAKHR 707 Server 8-bit Latin/Arabic

SB, ASCII

AR8SAKHR707T

SAKHR 707 8-bit Latin/Arabic

SB

AR8XBASIC

XBASIC 8-bit Latin/Arabic

SB

WE8BS2000L5

Siemens EBCDIC.DF.04.L5 8-bit West European/Turkish

SB

AL16UTF16

See "Universal Character Sets" for details

MB, EURO, FIXED

AL32UTF8

See "Universal Character Sets" for details

MB, ASCII, EURO

UTF8

See "Universal Character Sets" for details

MB, ASCII, EURO

UTFE

See "Universal Character Sets" for details

MB, EURO

Universal Character Sets

Table A-7 lists the Oracle character sets that provide universal language support. They attempt to support all languages of the world, including, but not limited to, Asian, European, and Middle Eastern languages.

Table A-7 Universal Character Sets
Name Description Comments

AL16UTF16

Unicode 3.2 UTF-16 Universal character set

MB, EURO, FIXED

AL32UTF8

Unicode 3.2 UTF-8 Universal character set

MB, ASCII, EURO

UTF8

Unicode 3.0 UTF-8 Universal character set, CESU-8 compliant

MB, ASCII, EURO

UTFE

EBCDIC form of Unicode 3.0 UTF-8 Universal character set (UTF-EBCDIC)

MB, EURO


Note:

CESU-8 defines an encoding scheme for Unicode that is identical to UTF-8 except for its representation of supplementary characters. In CESU-8, supplementary characters are represented as six-byte sequences that result from the transformation of each UTF-16 surrogate code unit into an eight-bit form that is similar to the UTF-8 transformation, but without first converting the input surrogate pairs to a scalar value. See Unicode Technical Report #26.


See Also:

Chapter 6, "Supporting Multilingual Databases with Unicode"

Character Set Conversion Support

The following character set encodings are supported for conversion only. They cannot be used as the database or national character set:

AL16UTF16LE
ISO2022-CN
ISO2022-JP
ISO2022-KR
HZ-GB-2312

You can use these character sets as the source_char_set or dest_char_set in the CONVERT function.

See Also:

Subsets and Supersets

Table A-8 lists common subset/superset relationships.

Table A-8 Subset-Superset Pairs  
Subset Superset

AR8ADOS710

AR8ADOS710T

AR8ADOS720

AR8ADOS720T

AR8ADOS720T

AR8ADOS720

AR8APTEC715

AR8APTEC715T

AR8ARABICMACT

AR8ARABICMAC

AR8ISO8859P6

AR8ASMO708PLUS

AR8ISO8859P6

AR8ASMO8X

AR8MUSSAD768

AR8MUSSAD768T

AR8MUSSAD768T

AR8MUSSAD768

AR8NAFITHA711

AR8NAFITHA711T

AR8NAFITHA721

AR8NAFITHA721T

AR8SAKHR707

AR8SAKHR707T

AR8SAKHR707T

AR8SAKHR707

BLT8CP921

BLT8ISO8859P13

BLT8CP921

LT8MSWIN921

D7DEC

D7SIEMENS9780X

D7SIEMENS9780X

D7DEC

DK7SIEMENS9780X

N7SIEMENS9780X

I7DEC

I7SIEMENS9780X

I7SIEMENS9780X

IW8EBCDIC424

IW8EBCDIC424

IW8EBCDIC1086

KO16KSC5601

KO16MSWIN949

LT8MSWIN921

BLT8ISO8859P13

LT8MSWIN921

BLT8CP921

N7SIEMENS9780X

DK7SIEMENS9780X

US7ASCII

See Table A-9, "US7ASCII Supersets".

UTF8

AL32UTF8

WE8DEC

TR8DEC

WE8DEC

WE8NCR4970

WE8ISO8859P1

WE8MSWIN1252

WE8ISO8859P9

TR8MSWIN1254

WE8NCR4970

TR8DEC

WE8NCR4970

WE8DEC

WE8PC850

WE8PC858

ZHS16GBK

ZHS32GB18030

US7ASCII is a special case because so many other character sets are supersets of it. Table A-9 lists supersets for US7ASCII.

Table A-9 US7ASCII Supersets  
Supersets Supersets Supersets

-

EE8MACCES

NEE8ISO8859P4

AL32UTF8

EE8MACCROATIANS

RU8BESTA

AR8ADOS710

EE8MSWIN1250

RU8PC855

AR8ADOS710T

EE8PC852

RU8PC866

AR8ADOS720

EL8DEC

SE8ISO8859P3

AR8ADOS720T

EL8ISO8859P7

TH8MACTHAIS

AR8APTEC715

EL8MACGREEKS

TH8TISASCII

AR8APTEC715T

EL8MSWIN1253

TR8DEC

AR8ARABICMACS

EL8PC437S

TR8MACTURKISHS

AR8ASMO708PLUS

EL8PC851

TR8MSWIN1254

AR8ASMO8X

EL8PC869

TR8PC857

AR8HPARABIC8T

ET8MSWIN923

US8PC437

AR8ISO8859P6

HU8ABMOD

UTF8

AR8MSWIN1256

HU8CWI2

VN8MSWIN1258

AR8MUSSAD768

IN8ISCII

VN8VN3

AR8MUSSAD768T

IS8PC861

WE8DEC

AR8NAFITHA711

IW8ISO8859P8

WE8DG

AR8NAFITHA711T

IW8MACHEBREWS

WE8ISO8859P1

AR8NAFITHA721

IW8MSWIN1255

WE8ISO8859P15

AR8NAFITHA721T

IW8PC1507

WE8ISO8859P9

AR8SAKHR706

JA16EUC

WE8MACROMAN8S

AR8SAKHR707

JA16SJIS

WE8MSWIN1252

AR8SAKHR707T

-

WE8NCR4970

AZ8ISO8859PE

-

-

BG8MSWIN

-

WE8NEXTSTEP

BG8PC437S

JA16VMS

WE8PC850

BLT8CP921

KO16KSC5601

WE8PC858

BLT8ISO8859P13

KO16KSCCS

WE8PC860

BLT8MSWIN1257

KO16MSWIN949

WE8ROMAN8

BLT8PC775

-

ZHS16CGB231280

BN8BSCII

LA8ISO6937

ZHS16GBK

CDN8PC863

LA8PASSPORT

ZHT16BIG5

CEL8ISO8859P14

LT8MSWIN921

ZHT16CCDC

CL8ISO8859P5

LT8PC772

ZHT16DBT

CL8KOI8R

LT8PC774

ZHT16HKSCS

CL8KOI8U

LV8PC1117

ZHT16MSWIN950

CL8ISOIR111

LV8PC8LR

ZHT32EUC

CL8MACCYRILLICS

LV8RST104090

ZHT32SOPS

CL8MSWIN1251

N8PC865

ZHT32TRIS

EE8ISO8859P2

NE8ISO8859P10

ZHS32GB18030

Language and Character Set Detection Support

Table A-10 displays the languages and character sets that are supported by the language and character set detection in the Character Set Scanner utilities (CSSCAN and LCSSCAN) and the Globalization Development Kit (GDK).

Each language has several character sets that can be detected.

When the binary values for a language match two or more encodings that have a subset/superset relationship, the subset character set is returned. For example, if the language is German and all characters are 7-bit, then US7ASCII is returned instead of WE8MSWIN1252, WE8ISO8859P15, or WE8ISO8859P1.

When the character set is determined to be UTF-8, the Oracle character set UTF8 is returned by default unless 4-byte characters (supplementary characters) are detected within the text. If 4-byte characters are detected, then the character set is reported as AL32UTF8.

Table A-10 Languages and Character Sets Supported by CSSCAN, LCSSCAN, and GDK  
Language Character Sets

Arabic

AL16UTF16, AL32UTF8, AR8ISO8859P6, AR8MSWIN1256, UTF8

Bulgarian

AL16UTF16, AL32UTF8, CL8ISO8859P5, CL8MSWIN1251, UTF8

Catalan

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Croatian

AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8

Czech

AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8

Danish

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Dutch

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

English

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Estonian

AL16UTF16, AL32UTF8, NEE8IOS8859P4, UTF8

Finnish

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

French

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

German

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Greek

AL16UTF16, AL32UTF8, EL8ISO8859P7, EL8MSWIN1253, UTF8

Hebrew

AL16UTF16, AL32UTF8, IW8ISO8859P8, IW8MSWIN1255, UTF8

Hungarian

AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8

Italian

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Japanese

AL16UTF16, AL32UTF8, ISO2022-JP, JA16EUC, JA16SJIS, UTF8

Korean

AL16UTF16, AL32UTF8, ISO2022-KR, KO16KSC5601, KO16MSWIN949, UTF8

Malay

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Norwegian

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Polish

AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8

Portuguese

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Romanian

AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8

Russian

AL16UTF16, AL32UTF8, CL8ISO8859P5, CL8KOI8R, CL8MSWIN1251, UTF8

Simplified Chinese

AL16UTF16, AL32UTF8, HZ-GB-2312, UTF8, ZHS16GBK, ZHS16CGB231280

Slovak

AL16UTF16, AL32UTF8, EE8ISO8859P2, EE8MSWIN1250, UTF8

Spanish

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Swedish

AL16UTF16, AL32UTF8, US7ASCII, UTF8, WE8ISO8859P1, WE8ISO8859P15, WE8MSWIN1252

Thai

AL16UTF16, AL32UTF8, TH8TISASCII, UTF8

Traditional Chinese

AL16UTF16, AL32UTF8, UTF8, ZHT16MSWIN950

Turkish

AL16UTF16, AL32UTF8, TR8MSWIN1254, UTF8, WE8ISO8859P9

Linguistic Sorts

Oracle offers two kinds of linguistic sorts, monolingual and multilingual. In addition, monolingual sorts can be extended to handle special cases. These special cases (represented with a prefix X) typically mean that the characters are sorted differently from their ASCII values. For example, ch and ll are treated as a single character in XSPANISH.

All of the linguistic sorts can be also be performed as case-insensitive or accent-insensitive by appending _CI or _AI to the linguistic sort name.

Table A-11 lists the monolingual linguistic sorts supported by the Oracle server.

See Also:

Table A-1, "Oracle Supported Languages" for a list of the default sort for each language

Table A-11 Monolingual Linguistic Sorts  
Basic Name Extended Name Special Cases

ARABIC

-

-

ARABIC_MATCH

-

-

ARABIC_ABJ_SORT

-

-

ARABIC_ABJ_MATCH

-

-

ASCII7

-

-

AZERBAIJANI

XAZERBAIJANI

i, I, lowercase i without dot, uppercase I with dot

BENGALI

-

-

BIG5

-

-

BINARY

-

-

BULGARIAN

-

-

CATALAN

XCATALAN

æ, AE, ß

CROATIAN

XCROATIAN

D, L, N, d, l, n, ß

CZECH

XCZECH

ch, CH, Ch, ß

CZECH_PUNCTUATION

XCZECH_PUNCTUATION

ch, CH, Ch, ß

DANISH

XDANISH

A, ß, Å, å

DUTCH

XDUTCH

ij, IJ

EBCDIC

-

-

EEC_EURO

-

-

EEC_EUROPA3

-

-

ESTONIAN

-

-

FINNISH

-

-

FRENCH

XFRENCH

-

GERMAN

XGERMAN

ß

GERMAN_DIN

XGERMAN_DIN

ß, ä, ö, ü, Ä, Ö, Ü

GBK

-

-

GREEK

-

-

HEBREW

-

-

HKSCS

-

-

HUNGARIAN

XHUNGARIAN

cs, gy, ny, sz, ty, zs, ß, CS, Cs, GY, Gy, NY, Ny, SZ, Sz, TY, Ty, ZS, Zs

ICELANDIC

-

-

INDONESIAN

-

-

ITALIAN

-

-

LATIN

-

-

LATVIAN

-

-

LITHUANIAN

-

-

MALAY

-

-

NORWEGIAN

-

-

POLISH

-

-

PUNCTUATION

XPUNCTUATION

-

ROMANIAN

-

-

RUSSIAN

-

-

SLOVAK

XSLOVAK

dz, DZ, Dz, ß (caron)

SLOVENIAN

XSLOVENIAN

ß

SPANISH

XSPANISH

ch, ll, CH, Ch, LL, Ll

SWEDISH

-

-

SWISS

XSWISS

ß

TURKISH

XTURKISH

æ, AE, ß

UKRAINIAN

-

-

UNICODE_BINARY

-

-

VIETNAMESE

-

-

WEST_EUROPEAN

XWEST_EUROPEAN

ß

Table A-12 lists the multilingual linguistic sorts available in Oracle. All of them include GENERIC_M (an ISO standard for sorting Latin-based characters) as a base. Multilingual linguistic sorts are used for a specific primary language together with Latin-based characters. For example, KOREAN_M sorts Korean and Latin-based characters, but it does not collate Chinese, Thai, or Japanese characters.

Table A-12 Multilingual LInguistic Sorts  
Sort Name Description

CANADIAN_M

Canadian French sort supports reverse secondary, special expanding characters

DANISH_M

Danish sort supports sorting lowercase characters before uppercase characters

FRENCH_M

French sort supports reverse sort for secondary

GENERIC_M

Generic sorting order which is based on ISO14651 and Unicode canonical equivalence rules but excluding compatible equivalence rules

JAPANESE_M

Japanese sort supports SJIS character set order and EUC characters which are not included in SJIS

KOREAN_M

Korean sort: Hangul characters are based on Unicode binary order. Hanja characters based on pronunciation order. All Hangul characters are before Hanja characters

SPANISH_M

Traditional Spanish sort supports special contracting characters

THAI_M

Thai sort supports swap characters for some vowels and consonants

SCHINESE_RADICAL_M

Simplified Chinese sort based on radical as primary order and number of strokes order as secondary order

SCHINESE_STROKE_M

Simplified Chinese sort uses number of strokes as primary order and radical as secondary order

SCHINESE_PINYIN_M

Simplified Chinese PinYin sorting order

TCHINESE_RADICAL_M

Traditional Chinese sort based on radical as primary order and number of strokes order as secondary order

TCHINESE_STROKE_M

Traditional Chinese sort uses number of strokes as primary order and radical as secondary order. It supports supplementary characters.

See Also:

Chapter 5, "Linguistic Sorting and String Searching"

Calendar Systems

By default, most territory definitions use the Gregorian calendar system. Table A-13 lists the other calendar systems supported by the Oracle server.

Table A-13 Supported Calendar Systems  
Name Default Date Format Character Set Used
For Default Date Format

Japanese Imperial

EEYYMMDD

JA16EUC

ROC Official

EEyymmdd

ZHT32EUC

Thai Buddha

dd month EE yyyy

TH8TISASCII

Persian

DD Month YYYY

AR8ASMO8X

Arabic Hijrah

DD Month YYYY

AR8ISO8859P6

English Hijrah

DD Month YYYY

AR8ISO8859P6

Figure A-1 shows how March 27, 1998 appears in Japanese Imperial.

Figure A-1 Japanese Imperial Example

Text description of nlspg005.gif follows.

Text description of the illustration nlspg005.gif

Time Zone Names

Table A-14 shows the time zone names from the large time zone file.

Table A-14 Time Zone Names  
Time Zone Name Is It in the Default Time Zone File? Time Zone Name Is It in the Default Time Zone File?

Africa/Algiers

No

Australia/Perth

Yes

Africa/Cairo

Yes

Australia/Queensland

Yes

Africa/Casablanca

No

Australia/South

Yes

Africa/Ceuta

No

Australia/Sydney

Yes

Africa/Djibouti

No

Australia/Tasmania

Yes

Africa/Freetown

No

Australia/Victoria

Yes

Africa/Johannesburg

No

Australia/West

Yes

Africa/Khartoum

No

Australia/Yancowinna

Yes

Africa/Mogadishu

No

Brazil/Acre

Yes

Africa/Nairobi

No

Brazil/DeNoronha

Yes

Africa/Nouakchott

No

Brazil/East

Yes

Africa/Tripoli

Yes

Brazil/West

Yes

Africa/Tunis

No

CET

Yes

Africa/Windhoek

No

CST

Yes

America/Adak

Yes

CST6CDT

Yes

America/Anchorage

Yes

Canada/Atlantic

Yes

America/Anguilla

No

Canada/Central

Yes

America/Araguaina

No

Canada/East-Saskatchewan

Yes

America/Aruba

No

Canada/Eastern

Yes

America/Asuncion

No

Canada/Mountain

Yes

America/Atka

Yes

Canada/Newfoundland

Yes

America/Belem

No

Canada/Pacific

Yes

America/Boa_Vista

No

Canada/Saskatchewan

Yes

America/Bogota

No

Canada/Yukon

Yes

America/Boise

No

Chile/Continental

Yes

America/Buenos_Aires

No

Chile/EasterIsland

Yes

America/Cambridge_Bay

No

Cuba

Yes

America/Cancun

No

EET

Yes

America/Caracas

No

EST

Yes

America/Cayenne

No

EST5EDT

Yes

America/Cayman

No

Egypt

Yes

America/Chicago

Yes

Eire

Yes

America/Chihuahua

No

Etc/GMT

Yes

America/Costa_Rica

No

Etc/GMT+0

Yes

America/Cuiaba

No

Etc/GMT+1

Yes

America/Curacao

No

Etc/GMT+10

Yes

America/Dawson

No

Etc/GMT+11

Yes

America/Dawson_Creek

No

Etc/GMT+12

Yes

America/Denver

Yes

Etc/GMT+2

Yes

America/Detroit

Yes

Etc/GMT+3

Yes

America/Edmonton

Yes

Etc/GMT+4

Yes

America/El_Salvador

No

Etc/GMT+5

Yes

America/Ensenada

Yes

Etc/GMT+6

Yes

America/Fort_Wayne

Yes

Etc/GMT+7

Yes

America/Fortaleza

No

Etc/GMT+8

Yes

America/Godthab

No

Etc/GMT+9

Yes

America/Goose_Bay

No

Etc/GMT-0

Yes

America/Grand_Turk

No

Etc/GMT-1

Yes

America/Guadeloupe

No

Etc/GMT-10

Yes

America/Guatemala

No

Etc/GMT-11

Yes

America/Guayaquil

No

-

-

America/Halifax

Yes

Etc/GMT-12

Yes

America/Havana

Yes

Etc/GMT-13

Yes

America/Indiana/Indianapolis

Yes

Etc/GMT-2

Yes

America/Indiana/Knox

No

Etc/GMT-3

Yes

America/Indiana/Marengo

No

Etc/GMT-4

Yes

America/Indiana/Vevay

No

Etc/GMT-5

Yes

America/Indianapolis

Yes

Etc/GMT-6

Yes

America/Inuvik

No

Etc/GMT-7

Yes

America/Iqaluit

No

Etc/GMT-8

Yes

America/Jamaica

Yes

Etc/GMT-9

Yes

America/Juneau

No

Etc/GMT0

Yes

America/Knox_IN

No

Etc/Greenwich

Yes

America/La_Paz

No

Europe/Amsterdam

No

America/Lima

No

Europe/Athens

No

America/Los_Angeles

Yes

Europe/Belfast

No

America/Louisville

No

Europe/Belgrade

No

America/Maceio

No

Europe/Berlin

No

America/Managua

No

Europe/Bratislava

No

America/Manaus

Yes

Europe/Brussels

No

America/Martinique

No

Europe/Bucharest

No

America/Mazatlan

Yes

Europe/Budapest

No

America/Mexico_City

Yes

Europe/Copenhagen

No

America/Miquelon

No

Europe/Dublin

Yes

America/Montevideo

No

Europe/Gibraltar

No

America/Montreal

Yes

Europe/Helsinki

No

America/Montserrat

No

Europe/Istanbul

Yes

America/New_York

Yes

Europe/Kaliningrad

No

America/Nome

No

Europe/Kiev

No

America/Noronha

Yes

Europe/Lisbon

Yes

America/Panama

No

Europe/Ljubljana

No

America/Phoenix

Yes

Europe/London

Yes

-

-

Europe/Luxembourg

No

America/Porto_Velho

No

Europe/Madrid

No

America/Puerto_Rico

No

Europe/Minsk

No

America/Rankin_Inlet

No

Europe/Monaco

No

America/Regina

Yes

Europe/Moscow

Yes

America/Rio_Branco

Yes

-

-

America/Santiago

Yes

Europe/Oslo

No

America/Sao_Paulo

Yes

Europe/Paris

No

America/Scoresbysund

No

Europe/Prague

No

America/Shiprock

Yes

Europe/Riga

No

America/St_Johns

Yes

Europe/Rome

No

America/St_Thomas

No

Europe/Samara

No

America/Swift_Current

No

Europe/San_Marino

No

America/Tegucigalpa

No

Europe/Sarajevo

No

America/Thule

No

Europe/Simferopol

No

America/Thunder_Bay

No

Europe/Skopje

No

America/Tijuana

Yes

Europe/Sofia

No

America/Tortola

No

Europe/Stockholm

No

America/Vancouver

Yes

Europe/Tallinn

No

America/Virgin

No

Europe/Tirane

No

America/Whitehorse

Yes

Europe/Vatican

No

America/Winnipeg

Yes

Europe/Vienna

No

America/Yellowknife

No

Europe/Vilnius

No

Arctic/Longyearbyen

No

Europe/Warsaw

Yes

Asia/Aden

No

Europe/Zagreb

No

Asia/Almaty

No

Europe/Zurich

No

Asia/Amman

No

GB

Yes

Asia/Anadyr

No

GB-Eire

Yes

Asia/Aqtau

No

GMT

Yes

Asia/Aqtobe

No

GMT+0

Yes

Asia/Baghdad

No

GMT-0

Yes

Asia/Bahrain

No

GMT0

Yes

Asia/Baku

No

Greenwich

Yes

Asia/Bangkok

No

HST

Yes

Asia/Beirut

No

Hongkong

Yes

Asia/Bishkek

No

Iceland

Yes

Asia/Calcutta

Yes

Indian/Chagos

No

Asia/Chungking

No

Indian/Christmas

No

Asia/Dacca

No

Indian/Cocos

No

Asia/Damascus

No

Indian/Mayotte

No

Asia/Dubai

No

Indian/Reunion

No

Asia/Gaza

No

Iran

Yes

Asia/Harbin

No

Israel

Yes

Asia/Hong_Kong

Yes

Jamaica

Yes

Asia/Irkutsk

No

Japan

Yes

Asia/Istanbul

Yes

Kwajalein

Yes

Asia/Jakarta

No

Libya

Yes

Asia/Jayapura

No

MET

Yes

Asia/Jerusalem

Yes

MST

Yes

Asia/Kabul

No

MST7MDT

Yes

Asia/Kamchatka

No

Mexico/BajaNorte

Yes

Asia/Karachi

No

Mexico/BajaSur

Yes

Asia/Kashgar

No

Mexico/General

Yes

Asia/Krasnoyarsk

No

NZ

Yes

Asia/Kuala_Lumpur

No

NZ-CHAT

Yes

Asia/Kuching

No

Navajo

Yes

Asia/Kuwait

No

PRC

Yes

Asia/Macao

No

PST

Yes

Asia/Magadan

No

PST8PDT

Yes

Asia/Manila

No

Pacific/Auckland

Yes

Asia/Muscat

No

Pacific/Chatham

Yes

Asia/Nicosia

No

Pacific/Easter

Yes

Asia/Novosibirsk

No

Pacific/Fakaofo

No

Asia/Omsk

No

Pacific/Fiji

No

Asia/Qatar

No

Pacific/Gambier

No

Asia/Rangoon

No

Pacific/Guam

No

Asia/Riyadh

Yes

Pacific/Honolulu

Yes

Asia/Saigon

No

Pacific/Johnston

No

Asia/Seoul

Yes

Pacific/Kiritimati

No

Asia/Shanghai

Yes

Pacific/Kwajalein

Yes

Asia/Singapore

Yes

Pacific/Marquesas

No

Asia/Taipei

Yes

Pacific/Midway

No

Asia/Tashkent

No

Pacific/Niue

No

Asia/Tbilisi

No

Pacific/Norfolk

No

Asia/Tehran

Yes

Pacific/Noumea

No

Asia/Tel_Aviv

Yes

Pacific/Pago_Pago

Yes

Asia/Tokyo

Yes

Pacific/Pitcairn

No

Asia/Ujung_Pandang

No

Pacific/Rarotonga

No

Asia/Urumqi

No

Pacific/Saipan

No

Asia/Vladivostok

No

Pacific/Samoa

Yes

Asia/Yakutsk

No

Pacific/Tahiti

No

Asia/Yekaterinburg

No

Pacific/Tongatapu

No

Asia/Yerevan

No

Pacific/Wake

No

Atlantic/Azores

No

Pacific/Wallis

No

Atlantic/Bermuda

No

Poland

Yes

Atlantic/Canary

No

Portugal

Yes

Atlantic/Faeroe

No

ROC

Yes

Atlantic/Madeira

No

ROK

Yes

Atlantic/Reykjavik

Yes

Singapore

Yes

Atlantic/St_Helena

No

Turkey

Yes

Atlantic/Stanley

No

US/Alaska

Yes

Australia/ACT

Yes

US/Aleutian

Yes

Australia/Adelaide

Yes

US/Arizona

Yes

Australia/Brisbane

Yes

US/Central

Yes

Australia/Broken_Hill

Yes

US/East-Indiana

Yes

Australia/Canberra

Yes

US/Eastern

Yes

Australia/Darwin

Yes

US/Hawaii

Yes

Australia/Hobart

Yes

US/Indiana-Starke

No

Australia/LHI

Yes

US/Michigan

Yes

Australia/Lindeman

Yes

US/Mountain

Yes

Australia/Lord_Howe

Yes

US/Pacific

Yes

Australia/Melbourne

Yes

US/Pacific-New

Yes

Australia/NSW

Yes

US/Samoa

Yes

Australia/North

Yes

UTC

No

-

-

W-SU

Yes

-

-

WET

Yes

See Also:

"Choosing a Time Zone File"

Obsolete Locale Data

This section contains information about obsolete linguistic sorts, character sets, languages, and territories.

Updates to the Oracle Language and Territory Definition Files

Changes have been made to the content in some of the language and territory definition files in Oracle Database 10g. These updates are necessary to correct the legacy definitions which no longer meet the local conventions in some of the Oracle supported languages and territories. These changes include modifications to the currency symbols, month names, and group separators. One example is the local currency symbol for Brazil. This has been updated from Cr$ to R$ in Oracle Database 10g.

Please refer to the "Oracle Language and Territory definition changes" table documented in the $ORACLE_HOME/nls/data/old/data_changes.html file for a detailed list of the changes.

Oracle Database 10g customers should review their existing application code and to make sure that the correct cultural conventions that are defined in Oracle Database 10g are being used. For customers who may not be able to make the necessary code changes to support their applications, Oracle offers Oracle9i backward compatibility by shipping a set of Oracle9i locale definition files with Oracle Database 10g. Please refer to the README.txt file in the $ORACLE_HOME/nls/data/old directory for more information.

Oracle Corporation strongly recommends that customers use the Oracle Database 10g locale definition files; Oracle9i locale definition files will be desupported in a future release.

Obsolete Linguistic Sorts

Table A-15 contains linguistic sorts that have been desupported in Oracle Database 10g.

Table A-15 Obsolete Linguistic Sorts in Oracle Database 10g  
Obsolete Sort Name Replacement Sort

THAI_TELEPHONE

THAI_M

THAI_DICTIONARY

THAI_M

CANADIAN_FRENCH

CANADIAN_M

JAPANESE

JAPANESE_M

CIS Is No Longer the Default Territory When the Language is RUSSIAN

RUSSIA has been added as a territory in OracleDatabase 10g. It is the default territory when the language is RUSSIAN. The CIS territory, which was the default territory in previous releases when the language was RUSSIAN, is supported for backward compatibility.

YUGOSLAVIA Is a Deprecated Territory

YUGOSLAVIA is a deprecated territory in Oracle Database 10g. Use SERBIA AND MONTENEGRO instead.

New Names for Obsolete Character Sets

Before Oracle release 7.2, when a character set was renamed, the old name was usually supported along with the new name for several releases after the change. Beginning with release 7.2, the old names are no longer supported.

Table A-16 lists the affected character sets. If you reference any of these character sets in your code, then replace them with their new name.

Table A-16 New Names for Obsolete Character Sets  
Old Name New Name

AL24UTFSS

UTF8, AL32UTF8

AR8MSAWIN

AR8MSWIN1256

CL8EBCDIC875S

CL8EBCDIC875R

EL8EBCDIC875S

EL8EBCDIC875R

JVMS

JA16VMS

JEUC

JA16EUC

SJIS

JA16SJIS

JDBCS

JA16DBCS

KSC5601

KO16KSC5601

KDBCS

KO16DBCS

CGB2312-80

ZHS16CGB231280

CNS 11643-86

ZHT32EUC

JA16EUCFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHS32EUCFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHS16GBKFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

JA16DBCSFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

KO16DBCSFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHS16DBCSFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHS16CGB231280FIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHT16DBCSFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

KO16KSC5601FIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

JA16SJISFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHT16BIG5FIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHT32TRISFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

Character set CL8MSWINDOW31 has been desupported. The newer character set CL8MSWIN1251 is actually a duplicate of CL8MSWINDOW31 and includes some characters omitted from the earlier version. Change any usage of CL8MSWINDOW31 to CL8MSWIN1251 instead.

AL24UTFFSS Character Set Desupported

The Unicode Character Set AL24UTFFSS was desupported in Oracle9i. AL24UTFFSS was introduced in version 7 as the Unicode character set supporting UTF-8 encoding scheme based on the Unicode standard 1.1, which is now obsolete. In Oracle Database 10g, Oracle offers the Unicode database character set AL32UTF8, which is based on Unicode 3.2, and UTF8, which is based on Unicode 3.0.

The migration path for an existing AL24UTFFSS database is to upgrade to UTF8 prior to upgrading to Oracle9i. As with all migrations to a new database character set, Oracle Corporation recommends that you use the Character Set Scanner for data analysis before attempting to migrate your existing database character set to UTF8.

See Also:

Chapter 12, "Character Set Scanner Utilities"

Bengali Language Definition Deprecated

Oracle's Bengali language definition is not compatible with Unicode standards. Oracle recommends that customers use the Bangla language definition instead. Bangla was introduced in Oracle9i Database Release 1 (9.0.1).

The Bengali language definition is supported in Oracle Database 10g Release 1 (10.1), but it may be desupported in a future release.

Czechoslovakia Territory Definition Deprecated

Oracle recommends that customers use either Czech Republic or Slovakia territory definitions in Oracle Database 10g Release 1 (10.1). The Czechoslovakia territory definition is supported in Oracle Database 10g Release 1 (10.1), but it may be desupported in a future release.