Most visited

Recently visited

Results for

Added in API level 24

UCharacter

public final class UCharacter
extends Object implements UCharacterEnums.ECharacterCategory, UCharacterEnums.ECharacterDirection

java.lang.Object
↳	android.icu.lang.UCharacter

[icu enhancement] ICU's replacement for Character. Methods, fields, and other functionality specific to ICU are labeled '[icu]'.

The UCharacter class provides extensions to the Character class. These extensions provide support for more Unicode properties. Each ICU release supports the latest version of Unicode available at that time.

For some time before Java 5 added support for supplementary Unicode code points, The ICU UCharacter class and many other ICU classes already supported them. Some UCharacter methods and constants were widened slightly differently than how the Character class methods and constants were widened later. In particular, MAX_VALUE is still a char with the value U+FFFF, while the MAX_VALUE is an int with the value U+10FFFF.

Code points are represented in these API using ints. While it would be more convenient in Java to have a separate primitive datatype for them, ints suffice in the meantime.

Aside from the additions for UTF-16 support, and the updated Unicode properties, the main differences between UCharacter and Character are:

UCharacter is not designed to be a char wrapper and does not have APIs to which involves management of that single char.
These include:
- char charValue(),
- int compareTo(java.lang.Character, java.lang.Character), etc.
UCharacter does not include Character APIs that are deprecated, nor does it include the Java-specific character information, such as boolean isJavaIdentifierPart(char ch).
Character maps characters 'A' - 'Z' and 'a' - 'z' to the numeric values '10' - '35'. UCharacter also does this in digit and getNumericValue, to adhere to the java semantics of these methods. New methods unicodeDigit, and getUnicodeNumericValue do not treat the above code points as having numeric values. This is a semantic change from ICU4J 1.3.1.

In addition to Java compatibility functions, which calculate derived properties, this API provides low-level access to the Unicode Character Database.

Unicode assigns each code point (not just assigned character) values for many properties. Most of them are simple boolean flags, or constants from a small enumerated list. For some properties, values are strings or other relatively more complex types.

For more information see "About the Unicode Character Database" (http://www.unicode.org/ucd/) and the ICU User Guide chapter on Properties (http://www.icu-project.org/userguide/properties.html).

There are also functions that provide easy migration from C/POSIX functions like isblank(). Their use is generally discouraged because the C/POSIX standards do not define their semantics beyond the ASCII range, which means that different implementations exhibit very different behavior. Instead, Unicode properties should be used directly.

There are also only a few, broad C/POSIX character classes, and they tend to be used for conflicting purposes. For example, the "isalpha()" class is sometimes used to determine word boundaries, while a more sophisticated approach would at least distinguish initial letters from continuation characters (the latter including combining marks). (In ICU, BreakIterator is the most sophisticated API for word boundaries.) Another example: There is no "istitle()" class for titlecase characters.

ICU 3.4 and later provides API access for all twelve C/POSIX character classes. ICU implements them according to the Standard Recommendations in Annex C: Compatibility Properties of UTS #18 Unicode Regular Expressions (http://www.unicode.org/reports/tr18/#Compatibility_Properties).

API access for C/POSIX character classes is as follows:

- alpha:     isUAlphabetic(c) or hasBinaryProperty(c, UProperty.ALPHABETIC)
 - lower:     isULowercase(c) or hasBinaryProperty(c, UProperty.LOWERCASE)
 - upper:     isUUppercase(c) or hasBinaryProperty(c, UProperty.UPPERCASE)
 - punct:     ((1<<getType(c)) & ((1<<DASH_PUNCTUATION)|(1<<START_PUNCTUATION)|
               (1<<END_PUNCTUATION)|(1<<CONNECTOR_PUNCTUATION)|(1<<OTHER_PUNCTUATION)|
               (1<<INITIAL_PUNCTUATION)|(1<<FINAL_PUNCTUATION)))!=0
 - digit:     isDigit(c) or getType(c)==DECIMAL_DIGIT_NUMBER
 - xdigit:    hasBinaryProperty(c, UProperty.POSIX_XDIGIT)
 - alnum:     hasBinaryProperty(c, UProperty.POSIX_ALNUM)
 - space:     isUWhiteSpace(c) or hasBinaryProperty(c, UProperty.WHITE_SPACE)
 - blank:     hasBinaryProperty(c, UProperty.POSIX_BLANK)
 - cntrl:     getType(c)==CONTROL
 - graph:     hasBinaryProperty(c, UProperty.POSIX_GRAPH)
 - print:     hasBinaryProperty(c, UProperty.POSIX_PRINT)

The C/POSIX character classes are also available in UnicodeSet patterns, using patterns like [:graph:] or \p{graph}.

[icu] Note: There are several ICU (and Java) whitespace functions. Comparison:

isUWhiteSpace=UCHAR_WHITE_SPACE: Unicode White_Space property; most of general categories "Z" (separators) + most whitespace ISO controls (including no-break spaces, but excluding IS1..IS4 and ZWSP)
isWhitespace: Java isWhitespace; Z + whitespace ISO controls but excluding no-break spaces
isSpaceChar: just Z (including no-break spaces)

This class is not subclassable.

See also:

UCharacterEnums

Summary

Nested classes
`interface`	`UCharacter.BidiPairedBracketType` Bidi Paired Bracket Type constants.
`interface`	`UCharacter.DecompositionType` Decomposition Type constants.
`interface`	`UCharacter.EastAsianWidth` East Asian Width constants.
`interface`	`UCharacter.GraphemeClusterBreak` Grapheme Cluster Break constants.
`interface`	`UCharacter.HangulSyllableType` Hangul Syllable Type constants.
`interface`	`UCharacter.JoiningGroup` Joining Group constants.
`interface`	`UCharacter.JoiningType` Joining Type constants.
`interface`	`UCharacter.LineBreak` Line Break constants.
`interface`	`UCharacter.NumericType` Numeric Type constants.
`interface`	`UCharacter.SentenceBreak` Sentence Break constants.
`class`	`UCharacter.UnicodeBlock` [icu enhancement] ICU's replacement for `Character.UnicodeBlock`. Methods, fields, and other functionality specific to ICU are labeled '[icu]'.
`interface`	`UCharacter.WordBreak` Word Break constants.

Constants
`int`	`FOLD_CASE_DEFAULT` [icu] Option value for case folding: use default mappings defined in CaseFolding.txt.
`int`	`FOLD_CASE_EXCLUDE_SPECIAL_I` [icu] Option value for case folding: Use the modified set of mappings provided in CaseFolding.txt to handle dotted I and dotless i appropriately for Turkic languages (tr, az).
`int`	`MAX_CODE_POINT` Constant U+10FFFF, same as `MAX_CODE_POINT`.
`char`	`MAX_HIGH_SURROGATE` Constant U+DBFF, same as `MAX_HIGH_SURROGATE`.
`char`	`MAX_LOW_SURROGATE` Constant U+DFFF, same as `MAX_LOW_SURROGATE`.
`int`	`MAX_RADIX` Compatibility constant for Java Character's MAX_RADIX.
`char`	`MAX_SURROGATE` Constant U+DFFF, same as `MAX_SURROGATE`.
`int`	`MAX_VALUE` The highest Unicode code point value (scalar value), constant U+10FFFF (uses 21 bits).
`int`	`MIN_CODE_POINT` Constant U+0000, same as `MIN_CODE_POINT`.
`char`	`MIN_HIGH_SURROGATE` Constant U+D800, same as `MIN_HIGH_SURROGATE`.
`char`	`MIN_LOW_SURROGATE` Constant U+DC00, same as `MIN_LOW_SURROGATE`.
`int`	`MIN_RADIX` Compatibility constant for Java Character's MIN_RADIX.
`int`	`MIN_SUPPLEMENTARY_CODE_POINT` Constant U+10000, same as `MIN_SUPPLEMENTARY_CODE_POINT`.
`char`	`MIN_SURROGATE` Constant U+D800, same as `MIN_SURROGATE`.
`int`	`MIN_VALUE` The lowest Unicode code point value, constant 0.
`double`	`NO_NUMERIC_VALUE` Special value that is returned by getUnicodeNumericValue(int) when no numeric value is defined for a code point.
`int`	`REPLACEMENT_CHAR` Unicode value used when translating into Unicode encoding form and there is no existing character.
`int`	`SUPPLEMENTARY_MIN_VALUE` The minimum value for Supplementary code points, constant U+10000.
`int`	`TITLECASE_NO_BREAK_ADJUSTMENT` Do not adjust the titlecasing indexes from BreakIterator::next() indexes; titlecase exactly the characters at breaks from the iterator.
`int`	`TITLECASE_NO_LOWERCASE` Do not lowercase non-initial parts of words when titlecasing.

Inherited constants

From interface


    android.icu.lang.UCharacterEnums.ECharacterCategory

From interface


    android.icu.lang.UCharacterEnums.ECharacterDirection

Public methods
`static int`	`charCount(int cp)` Same as `charCount(int)`.
`static final int`	`codePointAt(char[] text, int index, int limit)` Same as `codePointAt(char[], int, int)`.
`static final int`	`codePointAt(char[] text, int index)` Same as `codePointAt(char[], int)`.
`static final int`	`codePointAt(CharSequence seq, int index)` Same as `codePointAt(CharSequence, int)`.
`static final int`	`codePointBefore(char[] text, int index)` Same as `codePointBefore(char[], int)`.
`static final int`	`codePointBefore(CharSequence seq, int index)` Same as `codePointBefore(CharSequence, int)`.
`static final int`	`codePointBefore(char[] text, int index, int limit)` Same as `codePointBefore(char[], int, int)`.
`static int`	`codePointCount(CharSequence text, int start, int limit)` Equivalent to the `codePointCount(CharSequence, int, int)` method, for convenience.
`static int`	`codePointCount(char[] text, int start, int limit)` Equivalent to the `codePointCount(char[], int, int)` method, for convenience.
`static int`	`digit(int ch)` Returnss the numeric value of a decimal digit code point.
`static int`	`digit(int ch, int radix)` Returnss the numeric value of a decimal digit code point.
`static String`	`foldCase(String str, boolean defaultmapping)` [icu] The given string is mapped to its case folding equivalent according to UnicodeData.txt and CaseFolding.txt; if any character has no case folding equivalent, the character itself is returned.
`static int`	`foldCase(int ch, boolean defaultmapping)` [icu] The given character is mapped to its case folding equivalent according to UnicodeData.txt and CaseFolding.txt; if the character has no case folding equivalent, the character itself is returned.
`static int`	`foldCase(int ch, int options)` [icu] The given character is mapped to its case folding equivalent according to UnicodeData.txt and CaseFolding.txt; if the character has no case folding equivalent, the character itself is returned.
`static final String`	`foldCase(String str, int options)` [icu] The given string is mapped to its case folding equivalent according to UnicodeData.txt and CaseFolding.txt; if any character has no case folding equivalent, the character itself is returned.
`static char`	`forDigit(int digit, int radix)` Provide the java.lang.Character forDigit API, for convenience.
`static VersionInfo`	`getAge(int ch)` [icu] Returns the "age" of the code point.
`static int`	`getBidiPairedBracket(int c)` [icu] Maps the specified character to its paired bracket character.
`static int`	`getCharFromExtendedName(String name)` [icu] Find a Unicode character by either its name and return its code point value.
`static int`	`getCharFromName(String name)` [icu] Finds a Unicode code point by its most current Unicode name and return its code point value.
`static int`	`getCharFromNameAlias(String name)` [icu] Find a Unicode character by its corrected name alias and return its code point value.
`static int`	`getCodePoint(char char16)` [icu] Returns the code point corresponding to the BMP code point.
`static int`	`getCodePoint(char lead, char trail)` [icu] Returns a code point corresponding to the two surrogate code units.
`static int`	`getCombiningClass(int ch)` [icu] Returns the combining class of the argument codepoint
`static int`	`getDirection(int ch)` [icu] Returns the Bidirection property of a code point.
`static byte`	`getDirectionality(int cp)` Equivalent to the `getDirectionality(char)` method, for convenience.
`static String`	`getExtendedName(int ch)` [icu] Returns a name for a valid codepoint.
`static ValueIterator`	`getExtendedNameIterator()` [icu] Returns an iterator for character names, iterating over codepoints.
`static int`	`getHanNumericValue(int ch)` [icu] Returns the numeric value of a Han character.
`static int`	`getIntPropertyMaxValue(int type)` [icu] Returns the maximum value for an integer/binary Unicode property.
`static int`	`getIntPropertyMinValue(int type)` [icu] Returns the minimum value for an integer/binary Unicode property type.
`static int`	`getIntPropertyValue(int ch, int type)` [icu] Returns the property value for an Unicode property type of a code point.
`static int`	`getMirror(int ch)` [icu] Maps the specified code point to a "mirror-image" code point.
`static String`	`getName(int ch)` [icu] Returns the most current Unicode name of the argument code point, or null if the character is unassigned or outside the range UCharacter.MIN_VALUE and UCharacter.MAX_VALUE or does not have a name.
`static String`	`getName(String s, String separator)` [icu] Returns the names for each of the characters in a string
`static String`	`getNameAlias(int ch)` [icu] Returns the corrected name from NameAliases.txt if there is one.
`static ValueIterator`	`getNameIterator()` [icu] Returns an iterator for character names, iterating over codepoints.
`static int`	`getNumericValue(int ch)` Returns the numeric value of the code point as a nonnegative integer.
`static int`	`getPropertyEnum(CharSequence propertyAlias)` [icu] Return the UProperty selector for a given property name, as specified in the Unicode database file PropertyAliases.txt.
`static String`	`getPropertyName(int property, int nameChoice)` [icu] Return the Unicode name for a given property, as given in the Unicode database file PropertyAliases.txt.
`static int`	`getPropertyValueEnum(int property, CharSequence valueAlias)` [icu] Return the property value integer for a given value name, as specified in the Unicode database file PropertyValueAliases.txt.
`static String`	`getPropertyValueName(int property, int value, int nameChoice)` [icu] Return the Unicode name for a given property value, as given in the Unicode database file PropertyValueAliases.txt.
`static int`	`getType(int ch)` Returns a value indicating a code point's Unicode category.
`static RangeValueIterator`	`getTypeIterator()` [icu] Returns an iterator for character types, iterating over codepoints.
`static double`	`getUnicodeNumericValue(int ch)` [icu] Returns the numeric value for a Unicode code point as defined in the Unicode Character Database.
`static VersionInfo`	`getUnicodeVersion()` [icu] Returns the version of Unicode data used.
`static boolean`	`hasBinaryProperty(int ch, int property)` [icu] Check a binary Unicode property for a code point.
`static boolean`	`isBMP(int ch)` [icu] Determines if the code point is in the BMP plane.
`static boolean`	`isBaseForm(int ch)` [icu] Determines whether the specified code point is of base form.
`static boolean`	`isDefined(int ch)` Determines if a code point has a defined meaning in the up-to-date Unicode standard.
`static boolean`	`isDigit(int ch)` Determines if a code point is a Java digit.
`static boolean`	`isHighSurrogate(char ch)` Same as `isHighSurrogate(char)`.
`static boolean`	`isISOControl(int ch)` Determines if the specified code point is an ISO control character.
`static boolean`	`isIdentifierIgnorable(int ch)` Determines if the specified code point should be regarded as an ignorable character in a Java identifier.
`static boolean`	`isJavaIdentifierPart(int cp)` Compatibility override of Java method, delegates to java.lang.Character.isJavaIdentifierPart.
`static boolean`	`isJavaIdentifierStart(int cp)` Compatibility override of Java method, delegates to java.lang.Character.isJavaIdentifierStart.
`static boolean`	`isLegal(int ch)` [icu] A code point is illegal if and only if Out of bounds, less than 0 or greater than UCharacter.MAX_VALUE A surrogate value, 0xD800 to 0xDFFF Not-a-character, having the form 0x xxFFFF or 0x xxFFFE Note: legal does not mean that it is assigned in this version of Unicode.
`static boolean`	`isLegal(String str)` [icu] A string is legal iff all its code points are legal.
`static boolean`	`isLetter(int ch)` Determines if the specified code point is a letter.
`static boolean`	`isLetterOrDigit(int ch)` Determines if the specified code point is a letter or digit.
`static boolean`	`isLowSurrogate(char ch)` Same as `isLowSurrogate(char)`.
`static boolean`	`isLowerCase(int ch)` Determines if the specified code point is a lowercase character.
`static boolean`	`isMirrored(int ch)` Determines whether the code point has the "mirrored" property.
`static boolean`	`isPrintable(int ch)` [icu] Determines whether the specified code point is a printable character according to the Unicode standard.
`static boolean`	`isSpaceChar(int ch)` Determines if the specified code point is a Unicode specified space character, i.e.
`static boolean`	`isSupplementary(int ch)` [icu] Determines if the code point is a supplementary character.
`static final boolean`	`isSupplementaryCodePoint(int cp)` Same as `isSupplementaryCodePoint(int)`.
`static final boolean`	`isSurrogatePair(char high, char low)` Same as `isSurrogatePair(char, char)`.
`static boolean`	`isTitleCase(int ch)` Determines if the specified code point is a titlecase character.
`static boolean`	`isUAlphabetic(int ch)` [icu] Check if a code point has the Alphabetic Unicode property.
`static boolean`	`isULowercase(int ch)` [icu] Check if a code point has the Lowercase Unicode property.
`static boolean`	`isUUppercase(int ch)` [icu] Check if a code point has the Uppercase Unicode property.
`static boolean`	`isUWhiteSpace(int ch)` [icu] Check if a code point has the White_Space Unicode property.
`static boolean`	`isUnicodeIdentifierPart(int ch)` Determines if the specified code point may be any part of a Unicode identifier other than the starting character.
`static boolean`	`isUnicodeIdentifierStart(int ch)` Determines if the specified code point is permissible as the first character in a Unicode identifier.
`static boolean`	`isUpperCase(int ch)` Determines if the specified code point is an uppercase character.
`static final boolean`	`isValidCodePoint(int cp)` Equivalent to `isValidCodePoint(int)`.
`static boolean`	`isWhitespace(int ch)` Determines if the specified code point is a white space character.
`static int`	`offsetByCodePoints(CharSequence text, int index, int codePointOffset)` Equivalent to the `offsetByCodePoints(CharSequence, int, int)` method, for convenience.
`static int`	`offsetByCodePoints(char[] text, int start, int count, int index, int codePointOffset)` Equivalent to the `offsetByCodePoints(char[], int, int, int, int)` method, for convenience.
`static final int`	`toChars(int cp, char[] dst, int dstIndex)` Same as `toChars(int, char[], int)`.
`static final char[]`	`toChars(int cp)` Same as `toChars(int)`.
`static final int`	`toCodePoint(char high, char low)` Same as `toCodePoint(char, char)`.
`static String`	`toLowerCase(String str)` Returns the lowercase version of the argument string.
`static int`	`toLowerCase(int ch)` The given code point is mapped to its lowercase equivalent; if the code point has no lowercase equivalent, the code point itself is returned.
`static String`	`toLowerCase(ULocale locale, String str)` Returns the lowercase version of the argument string.
`static String`	`toLowerCase(Locale locale, String str)` Returns the lowercase version of the argument string.
`static String`	`toString(int ch)` Converts argument code point and returns a String object representing the code point's value in UTF-16 format.
`static String`	`toTitleCase(Locale locale, String str, BreakIterator titleIter, int options)` [icu] Returns the titlecase version of the argument string.
`static String`	`toTitleCase(ULocale locale, String str, BreakIterator titleIter)` Returns the titlecase version of the argument string.
`static String`	`toTitleCase(String str, BreakIterator breakiter)` Returns the titlecase version of the argument string.
`static String`	`toTitleCase(ULocale locale, String str, BreakIterator titleIter, int options)` Returns the titlecase version of the argument string.
`static String`	`toTitleCase(Locale locale, String str, BreakIterator breakiter)` Returns the titlecase version of the argument string.
`static int`	`toTitleCase(int ch)` Converts the code point argument to titlecase.
`static String`	`toUpperCase(Locale locale, String str)` Returns the uppercase version of the argument string.
`static String`	`toUpperCase(ULocale locale, String str)` Returns the uppercase version of the argument string.
`static int`	`toUpperCase(int ch)` Converts the character argument to uppercase.
`static String`	`toUpperCase(String str)` Returns the uppercase version of the argument string.

Inherited methods

From class


  
    java.lang.Object

`Object`	`clone()` Creates and returns a copy of this object.
`boolean`	`equals(Object obj)` Indicates whether some other object is "equal to" this one.
`void`	`finalize()` Called by the garbage collector on an object when garbage collection determines that there are no more references to the object.
`final Class<?>`	`getClass()` Returns the runtime class of this `Object`.
`int`	`hashCode()` Returns a hash code value for the object.
`final void`	`notify()` Wakes up a single thread that is waiting on this object's monitor.
`final void`	`notifyAll()` Wakes up all threads that are waiting on this object's monitor.
`String`	`toString()` Returns a string representation of the object.
`final void`	`wait(long millis, int nanos)` Causes the current thread to wait until another thread invokes the `notify()` method or the `notifyAll()` method for this object, or some other thread interrupts the current thread, or a certain amount of real time has elapsed.
`final void`	`wait(long millis)` Causes the current thread to wait until either another thread invokes the `notify()` method or the `notifyAll()` method for this object, or a specified amount of time has elapsed.
`final void`	`wait()` Causes the current thread to wait until another thread invokes the `notify()` method or the `notifyAll()` method for this object.

Constants

FOLD_CASE_DEFAULT

Added in API level 24

int FOLD_CASE_DEFAULT

[icu] Option value for case folding: use default mappings defined in CaseFolding.txt.

Constant Value: 0 (0x00000000)

FOLD_CASE_EXCLUDE_SPECIAL_I

Added in API level 24

int FOLD_CASE_EXCLUDE_SPECIAL_I

[icu] Option value for case folding: Use the modified set of mappings provided in CaseFolding.txt to handle dotted I and dotless i appropriately for Turkic languages (tr, az).

Before Unicode 3.2, CaseFolding.txt contains mappings marked with 'I' that are to be included for default mappings and excluded for the Turkic-specific mappings.

Unicode 3.2 CaseFolding.txt instead contains mappings marked with 'T' that are to be excluded for default mappings and included for the Turkic-specific mappings.

Constant Value: 1 (0x00000001)

MAX_CODE_POINT

Added in API level 24

int MAX_CODE_POINT

Constant U+10FFFF, same as MAX_CODE_POINT.

Constant Value: 1114111 (0x0010ffff)

MAX_HIGH_SURROGATE

Added in API level 24

char MAX_HIGH_SURROGATE

Constant U+DBFF, same as MAX_HIGH_SURROGATE.

Constant Value: 56319 (0x0000dbff)

MAX_LOW_SURROGATE

Added in API level 24

char MAX_LOW_SURROGATE

Constant U+DFFF, same as MAX_LOW_SURROGATE.

Constant Value: 57343 (0x0000dfff)

MAX_RADIX

Added in API level 24

int MAX_RADIX

Compatibility constant for Java Character's MAX_RADIX.

Constant Value: 36 (0x00000024)

MAX_SURROGATE

Added in API level 24

char MAX_SURROGATE

Constant U+DFFF, same as MAX_SURROGATE.

Constant Value: 57343 (0x0000dfff)

MAX_VALUE

Added in API level 24

int MAX_VALUE

The highest Unicode code point value (scalar value), constant U+10FFFF (uses 21 bits). Same as MAX_CODE_POINT.

Up-to-date Unicode implementation of MAX_VALUE which is still a char with the value U+FFFF.

Constant Value: 1114111 (0x0010ffff)

MIN_CODE_POINT

Added in API level 24

int MIN_CODE_POINT

Constant U+0000, same as MIN_CODE_POINT.

Constant Value: 0 (0x00000000)

MIN_HIGH_SURROGATE

Added in API level 24

char MIN_HIGH_SURROGATE

Constant U+D800, same as MIN_HIGH_SURROGATE.

Constant Value: 55296 (0x0000d800)

MIN_LOW_SURROGATE

Added in API level 24

char MIN_LOW_SURROGATE

Constant U+DC00, same as MIN_LOW_SURROGATE.

Constant Value: 56320 (0x0000dc00)

MIN_RADIX

Added in API level 24

int MIN_RADIX

Compatibility constant for Java Character's MIN_RADIX.

Constant Value: 2 (0x00000002)

MIN_SUPPLEMENTARY_CODE_POINT

Added in API level 24

int MIN_SUPPLEMENTARY_CODE_POINT

Constant U+10000, same as MIN_SUPPLEMENTARY_CODE_POINT.

Constant Value: 65536 (0x00010000)

MIN_SURROGATE

Added in API level 24

char MIN_SURROGATE

Constant U+D800, same as MIN_SURROGATE.

Constant Value: 55296 (0x0000d800)

MIN_VALUE

Added in API level 24

int MIN_VALUE

The lowest Unicode code point value, constant 0. Same as MIN_CODE_POINT, same integer value as MIN_VALUE.

Constant Value: 0 (0x00000000)

NO_NUMERIC_VALUE

Added in API level 24

double NO_NUMERIC_VALUE

Special value that is returned by getUnicodeNumericValue(int) when no numeric value is defined for a code point.

See also:

getUnicodeNumericValue(int)

Constant Value: -1.23456789E8

REPLACEMENT_CHAR

Added in API level 24

int REPLACEMENT_CHAR

Unicode value used when translating into Unicode encoding form and there is no existing character.

Constant Value: 65533 (0x0000fffd)

SUPPLEMENTARY_MIN_VALUE

Added in API level 24

int SUPPLEMENTARY_MIN_VALUE

The minimum value for Supplementary code points, constant U+10000. Same as MIN_SUPPLEMENTARY_CODE_POINT.

Constant Value: 65536 (0x00010000)

TITLECASE_NO_BREAK_ADJUSTMENT

Added in API level 24

int TITLECASE_NO_BREAK_ADJUSTMENT

Do not adjust the titlecasing indexes from BreakIterator::next() indexes; titlecase exactly the characters at breaks from the iterator. Option bit for titlecasing APIs that take an options bit set. By default, titlecasing will take each break iterator index, adjust it by looking for the next cased character, and titlecase that one. Other characters are lowercased. This follows Unicode 4 & 5 section 3.13 Default Case Operations: R3 toTitlecase(X): Find the word boundaries based on Unicode Standard Annex #29, "Text Boundaries." Between each pair of word boundaries, find the first cased character F. If F exists, map F to default_title(F); then map each subsequent character C to default_lower(C).

See also:

Constant Value: 512 (0x00000200)

TITLECASE_NO_LOWERCASE

Added in API level 24

int TITLECASE_NO_LOWERCASE

Do not lowercase non-initial parts of words when titlecasing. Option bit for titlecasing APIs that take an options bit set. By default, titlecasing will titlecase the first cased character of a word and lowercase all other characters. With this option, the other characters will not be modified.

See also:

toTitleCase(ULocale, String, BreakIterator)

Constant Value: 256 (0x00000100)

Parameters
`text`	`char`: the characters to check
`index`	`int`: the index of the first or only char forming the code point
`limit`	`int`: the limit of the valid text

Parameters
`seq`	`CharSequence`: the characters to check
`index`	`int`: the index of the first or only char forming the code point

Parameters
`text`	`CharSequence`: the characters to check
`start`	`int`: the start of the range
`limit`	`int`: the limit of the range

Parameters
`str`	`String`: the String to be converted
`defaultmapping`	`boolean`: Indicates whether the default mappings defined in CaseFolding.txt are to be used, otherwise the mappings for dotted I and dotless i marked with 'T' in CaseFolding.txt are included.

Parameters
`ch`	`int`: the character to be converted
`defaultmapping`	`boolean`: Indicates whether the default mappings defined in CaseFolding.txt are to be used, otherwise the mappings for dotted I and dotless i marked with 'T' in CaseFolding.txt are included.

Parameters
`ch`	`int`: the character to be converted
`options`	`int`: A bit set for special processing. Currently the recognised options are FOLD_CASE_EXCLUDE_SPECIAL_I and FOLD_CASE_DEFAULT

Parameters
`ch`	`int`: code point to test.
`type`	`int`: UProperty selector constant, identifies which binary property to check. Must be UProperty.BINARY_START <= type < UProperty.BINARY_LIMIT or UProperty.INT_START <= type < UProperty.INT_LIMIT or UProperty.MASK_START <= type < UProperty.MASK_LIMIT.

Parameters
`s`	`String`: string to format
`separator`	`String`: string to go between names

Parameters
`property`	`int`: UProperty selector.
`nameChoice`	`int`: UProperty.NameChoice selector for which name to get. All properties have a long name. Most have a short name, but some do not. Unicode allows for additional names; if present these will be returned by UProperty.NameChoice.LONG + i, where i=1, 2,...

Parameters
`property`	`int`: UProperty selector constant. UProperty.INT_START <= property < UProperty.INT_LIMIT or UProperty.BINARY_START <= property < UProperty.BINARY_LIMIT or UProperty.MASK_START < = property < UProperty.MASK_LIMIT. Only these properties can be enumerated.
`valueAlias`	`CharSequence`: the value name to be matched. The name is compared using "loose matching" as described in PropertyValueAliases.txt.

Parameters
`property`	`int`: UProperty selector constant. UProperty.INT_START <= property < UProperty.INT_LIMIT or UProperty.BINARY_START <= property < UProperty.BINARY_LIMIT or UProperty.MASK_START < = property < UProperty.MASK_LIMIT. If out of range, null is returned.
`value`	`int`: selector for a value for the given property. In general, valid values range from 0 up to some maximum. There are a few exceptions: (1.) UProperty.BLOCK values begin at the non-zero value BASIC_LATIN.getID(). (2.) UProperty.CANONICAL_COMBINING_CLASS values are not contiguous and range from 0..240. (3.) UProperty.GENERAL_CATEGORY_MASK values are mask values produced by left-shifting 1 by UCharacter.getType(). This allows grouped categories such as [:L:] to be represented. Mask values are non-contiguous.
`nameChoice`	`int`: UProperty.NameChoice selector for which name to get. All values have a long name. Most have a short name, but some do not. Unicode allows for additional names; if present these will be returned by UProperty.NameChoice.LONG + i, where i=1, 2,...

Parameters
`high`	`char`: the high (lead) char
`low`	`char`: the low (trail) char

Parameters
`cp`	`int`: the code point to convert
`dst`	`char`: the destination array into which to put the char(s) representing the code point
`dstIndex`	`int`: the index at which to put the first (or only) char

Parameters
`high`	`char`: the high (lead) surrogate
`low`	`char`: the low (trail) surrogate

Parameters
`locale`	`ULocale`: which string is to be converted in
`str`	`String`: source string to be performed on

Parameters
`locale`	`Locale`: which string is to be converted in
`str`	`String`: source string to be performed on

Parameters
`str`	`String`: source string to be performed on
`breakiter`	`BreakIterator`: break iterator to determine the positions in which the character should be title cased.

Most visited

Recently visited

Results for

UCharacter

Summary

Nested classes

Constants

Inherited constants

Public methods

Inherited methods

Constants

FOLD_CASE_DEFAULT

FOLD_CASE_EXCLUDE_SPECIAL_I

MAX_CODE_POINT

MAX_HIGH_SURROGATE

MAX_LOW_SURROGATE

MAX_RADIX

MAX_SURROGATE

MAX_VALUE

MIN_CODE_POINT

MIN_HIGH_SURROGATE

MIN_LOW_SURROGATE

MIN_RADIX

MIN_SUPPLEMENTARY_CODE_POINT

MIN_SURROGATE

MIN_VALUE

NO_NUMERIC_VALUE

REPLACEMENT_CHAR

SUPPLEMENTARY_MIN_VALUE

TITLECASE_NO_BREAK_ADJUSTMENT

TITLECASE_NO_LOWERCASE

Public methods

charCount

codePointAt

codePointAt

codePointAt

codePointBefore

codePointBefore

codePointBefore

codePointCount

codePointCount

digit

digit

foldCase

foldCase

foldCase

foldCase

forDigit

getAge

getBidiPairedBracket

getCharFromExtendedName

getCharFromName

getCharFromNameAlias

getCodePoint

getCodePoint

getCombiningClass

getDirection

getDirectionality

getExtendedName

getExtendedNameIterator

getHanNumericValue

getIntPropertyMaxValue

getIntPropertyMinValue

getIntPropertyValue

getMirror

getName

getName

getNameAlias

getNameIterator

getNumericValue

getPropertyEnum

getPropertyName

getPropertyValueEnum

getPropertyValueName

getType

getTypeIterator

getUnicodeNumericValue

getUnicodeVersion

hasBinaryProperty

isBMP