Radical definition and dictionary creation for a handwriting recognition system

The system described herein automatically defines a set of radicals to be used in a Kanji character handwriting recognition system and automatically creates a dictionary of the Kanji characters that are recognized by the system. In performing its functionality, the system described herein first obtains representative handwriting samples for each Kanji character that is to be recognized by the system. The system described herein then evaluates the samples to identify a set of subparts ("radicals") that are common to at least two of the Kanji characters. These radicals represent component roots from which the characters are formed. Each Kanji character is formed by one or more of these radicals. The radicals that are identified by the system described herein are not constrained to any preset definition (e.g., the traditional set of radicals used to organize Japanese dictionaries). Thus, the radicals utilized by the system described herein may include some of the traditional radicals or may include none of the traditional radicals. After identifying the set of radicals, the system described herein generates a dictionary with a mapping of each Kanji character that is to be recognized by the system to its component radicals. After the set of radicals and the dictionary have been created, these components can be utilized during handwriting recognition. When performing handwriting recognition, the system described herein identifies the radicals within the handwriting and then uses the mapping to determine which Kanji character the handwriting most closely matches.
El sistema descrito adjunto define automáticamente un sistema de radicales que se utilizarán en un sistema del reconocimiento del cursivo del carácter del kanji y crea automáticamente un diccionario de los caracteres del kanji que son reconocidos por el sistema. En la ejecución de su funcionalidad, el sistema descrito adjunto primero obtiene las muestras representativas del cursivo para cada carácter del kanji que deba ser reconocido por el sistema. El sistema descrito adjunto entonces evalúa las muestras para identificar un sistema de los subparts ("radicales") que son comunes por lo menos a dos de los caracteres del kanji. Estos radicales representan las raíces componentes de las cuales se forman los caracteres. Cada carácter del kanji es formado por uno o más de estos radicales. Los radicales que son identificados por el sistema descrito adjunto no se obligan a cualesquiera a preestablecer la definición (e.g., el sistema tradicional de radicales usados para organizar los diccionarios japoneses). Así, los radicales utilizados por el sistema descrito adjunto pueden incluir algunos de los radicales tradicionales o no pueden incluir ningunos de los radicales tradicionales. Después de identificar el sistema de radicales, el sistema descrito adjunto genera un diccionario con traz de cada carácter del kanji que deba ser reconocido por el sistema a sus radicales componentes. Después de que el sistema de radicales y el diccionario se hayan creado, estos componentes se pueden utilizar durante el reconocimiento del cursivo. Al realizar el reconocimiento del cursivo, el sistema descrito adjunto identifica los radicales dentro del cursivo y después utiliza traz para determinarse qué carácter del kanji empareja el cursivo lo más de cerca posible.

Web www.patentalert.com

< (none)

< Confidence measures using sub-word-dependent weighting of sub-word confidence scores for robust speech recognition

> Skipped-state method for mouse encoding

> (none)

~ 00065