US20140288924A1 - Systems and methods for an automated personalized dictionary generator for portable devices - Google Patents
Systems and methods for an automated personalized dictionary generator for portable devices Download PDFInfo
- Publication number
- US20140288924A1 US20140288924A1 US14/300,174 US201414300174A US2014288924A1 US 20140288924 A1 US20140288924 A1 US 20140288924A1 US 201414300174 A US201414300174 A US 201414300174A US 2014288924 A1 US2014288924 A1 US 2014288924A1
- Authority
- US
- United States
- Prior art keywords
- word
- words
- dictionary
- profanities
- user
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 87
- 230000003068 static effect Effects 0.000 claims description 22
- 238000012545 processing Methods 0.000 claims description 10
- 238000004891 communication Methods 0.000 abstract description 6
- 239000000463 material Substances 0.000 abstract description 3
- 230000002708 enhancing effect Effects 0.000 abstract 1
- 230000008569 process Effects 0.000 description 59
- 230000000153 supplemental effect Effects 0.000 description 20
- 238000010586 diagram Methods 0.000 description 12
- 230000008901 benefit Effects 0.000 description 9
- 230000002250 progressing effect Effects 0.000 description 9
- 238000007619 statistical method Methods 0.000 description 6
- 238000000605 extraction Methods 0.000 description 4
- 230000009286 beneficial effect Effects 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 230000009471 action Effects 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- 238000004422 calculation algorithm Methods 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000008520 organization Effects 0.000 description 2
- 238000004321 preservation Methods 0.000 description 2
- 230000001755 vocal effect Effects 0.000 description 2
- 241001672694 Citrus reticulata Species 0.000 description 1
- 241001610351 Ipsa Species 0.000 description 1
- 244000141359 Malus pumila Species 0.000 description 1
- 240000000220 Panda oleosa Species 0.000 description 1
- 235000016496 Panda oleosa Nutrition 0.000 description 1
- 241000220324 Pyrus Species 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 235000021016 apples Nutrition 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000005352 clarification Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 230000002996 emotional effect Effects 0.000 description 1
- 230000006397 emotional response Effects 0.000 description 1
- 230000002650 habitual effect Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000036651 mood Effects 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 235000021017 pears Nutrition 0.000 description 1
- 230000036316 preload Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 238000013515 script Methods 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Images
Classifications
-
- G06F17/2735—
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/237—Lexical tools
- G06F40/242—Dictionaries
-
- G06F17/276—
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/274—Converting codes to words; Guess-ahead of partial word inputs
Definitions
- This invention relates generally to generation of a personalized dictionary for portable devices. More particularly, the present invention relates to a method for populating a personalized dictionary in a semi automated fashion. This is achieved through the analysis of communication messages written, spoken, sent or received on the portable device.
- Text may include any written characters, or transcriptions of verbal messages.
- Such text or verbal message may include text using Roman based alphabets, Chinese alphabet, Arabic scripts, or virtually any known language's symbology.
- Mobile devices typically have less processing power and storage resources available than a stationary computer system. Additionally, due to the small size of these personal appliances, keypads are typically very small or require multiple keytaps. This small, highly portable size of the devices that enable mobile text connectivity also render the input of such text onerous.
- typical personal portable appliances may include utilities that facilitate the generation or entry of textual material for messaging purposes.
- these utilities may be one of several types, or some combination, including: i) systems which allow a user to enter text letter by letter using a scheme where a letter on a key is specifically identified in a deterministic fashion commonly called multi-tap systems, and ii) systems which match sequences of keys to word possibilities either algorithmically or by matching pre-stored dictionary entries, and iii) fully deterministic systems having a one to one correspondence to desired symbols such as a full keyboard, albeit miniaturized. These latter systems, of course, allow complete flexibility of symbol string entry.
- the first is a static dictionary which is formulated from a substantial corpus in the target language.
- static dictionaries may additionally be referred to as a static element, base dictionary, first dictionary or static word list.
- base dictionary In the initial use of the appliance, the performance of the utility is dominated by this static element.
- static dictionary may be changed in some modern appliances, such static dictionaries are, at best, quasi-static since changing content may confuse or distract the user and may confound manufacturer support activities.
- the second dictionary component is a used word listing that may have an associated ordering algorithm.
- a used word list may additionally be referred to as a used word dictionary, usage dictionary, second dictionary or common word list.
- a used word dictionary is helpful in that words and text constructs peculiar to that user are saved. Since a user tends, by and large, to use words and structures that have become habitual, and thus personal to the user, intended words may be predicted based upon the usage patterns established. This is believed to speed system response, generally, since users tend to re-use certain words and it is far better to keep a separate entry list than to attempt to manage the full dictionary; again system support is eased if the primary dictionary is kept fairly static.
- a third list may be present that allows a user to create words that may be absent from the primary dictionary.
- Such a third list may additionally be referred to as a supplemental element, supplemental dictionary, third dictionary or supplemental word list.
- the supplemental dictionary allows preservation of the root dictionary whilst permitting a personal list of items, such as proper names or terms of art, relevant to a particular user to be stored.
- Another current method of addressing such an issue is to attempt to preload dictionary sets so that the user has fewer words to manually input. This has been met with mixed success, since such predetermined lists are very costly and difficult to compile, and are often non-reflective of what terms and words the user desires to use.
- the current lack of rapid dictionary population may be inadequate as requiring too much manual attention from the users, or requiring too much storage for exhaustive dictionary sets.
- Manufacturers and retailers of mobile devices would benefit greatly from the ability to offer devices with accurate and rapid dictionary word population. Additionally, users of these mobile devices would benefit greatly by having reduced aggravation and more efficiency when initially inputting text on the mobile device.
- the current invention aids in automating, at least in part, the creation of the supplemental dictionary.
- a considerable benefit is that caller name records may be built rapidly as may be terms of art, thus freeing the user from the laborious task of creating each entry one by one.
- a method and system for automated dictionary population is provided. Such a system is useful for a user of mobile devices to efficiently produce text data yet avoid much of the laborious task of deterministically entering every new word for storage and future use.
- the mobile device may include at least one dictionary which includes entries. Every time the device receives a communication intended for the user, the information may be parsed and textual data extracted. The text is then compared to the entries of the dictionaries to identify new words. Statistical information may be generated for the parsed words. This information includes word usage frequency, recency, or likelihood of use.
- Profanities may be processed by identifying profanities within the parsed words by comparing the parsed words to a profanity word list, modifying the profanities by replacing at least some of the profanity with a place marker and displaying the modified profanity to a user in a candidate list. Then the user may be asked to provide feedback either selecting or deselecting the profanity. Selecting the profanity results in displaying the profanity to the user and storing the profanity. De-selecting the profanity removes the profanity from the candidate list.
- phrases from the parsed words may be identified by phrase markers, which may include at least one of italicized word groups, quoted word groups, bolded word groups, capitalized word groups, word groups containing more than one new word, and groups of words including joining words.
- the new words may be stored in a supplementary dictionary or word list. These words may be stored as single words or may be stored utilizing linking the words of the identified phrases to preserve any phrase relationships. This is valuable in the case of certain professions where a phrase may be a term of art and the individual words may be less useful when used alone. Likewise, the statistical information may be stored.
- Voice data may also be processed and harvested for word samples in the same way.
- voice messages may be machine converted to textual form external to the mobile appliance and submitted to the appliance using the GSM short message service or similar service.
- the stored data harvested in these ways allows relevant candidates to be shown to the user more frequently than those extracted from a static dictionary constructed from corpora having a broader or more general statistical bias. Moreover, by combining words that are related to form phrases, it has been found that a substantial improvement in the candidate quality and a reduction in required keystrokes is usual.
- FIG. 1 shows a logical block diagram of an automated dictionary population system in accordance with an embodiment of the present invention
- FIG. 2 shows a logical block diagram of a dictionary set for the automated dictionary population system of FIG. 1 ;
- FIG. 3 shows a logical block diagram of a processor for the automated dictionary population system of FIG. 1 ;
- FIG. 4 shows a logical block diagram of a word extractor for the automated dictionary population system of FIG. 1 ;
- FIG. 5 shows a logical block diagram of a statistical for the automated dictionary population system of FIG. 1 ;
- FIG. 6 shows a logical block diagram of a phrasing analyzer for the automated dictionary population system of FIG. 1 ;
- FIG. 7 shows an illustration of a mobile device in conjunction with a communication network in accordance with an embodiment of the present invention
- FIG. 8 shows an illustration of an ambiguous style keypad associated with the mobile device in accordance with an embodiment of the present invention
- FIG. 9 shows an illustration of a deterministic style keypad associated with the mobile device in accordance with an embodiment of the present invention.
- FIG. 10 shows a flow chart illustrating a process of automated dictionary population in accordance with an embodiment of the present invention
- FIG. 11 shows a flow chart illustrating a process of message processing in accordance with an embodiment of the present invention
- FIG. 12 shows a flow chart illustrating a process of word extraction in accordance with an embodiment of the present invention
- FIG. 13 shows a flow chart illustrating a process of profanity interruption in accordance with an embodiment of the present invention
- FIG. 14 shows a flow chart illustrating a process of statistical analysis of words in accordance with an embodiment of the present invention
- FIG. 15 shows a flow chart illustrating a process of analysis for word groups in accordance with an embodiment of the present invention.
- FIG. 16 shows a flow chart illustrating a process of identifying phrase groups in accordance with an embodiment of the present invention.
- the present invention relates generally to semi automated dictionary population system and method to provide fast and efficient dictionary generation and personalization for mobile devices (also known as a personal appliance). More particularly, the present invention relates to a method for dictionary population that requires fewer storage resources and less distracting inputs from the user.
- FIG. 1 shows a logical block diagram of an Automated Dictionary Population System 100 .
- the Automated Dictionary Population System 100 may include a User 101 which interacts with a Dictionary System 110 . Additionally, the Dictionary System 110 may, in some embodiments, interface an External Wireless Network 103 . The Dictionary System 110 may, in some embodiments, provide population of dictionaries.
- the Dictionary System 110 may include an Interface 111 , a Message Storage 113 , a Dictionary Set 115 , a Processor 117 and a Wireless Connector 121 .
- the Interface 111 may enable the User 101 to interact with the Dictionary System 110 .
- the Wireless Connector 121 may enable the Dictionary System 110 to access the External Wireless Network 103 .
- the External Wireless Network 103 may include a Wide Area Network (WAN) such as the internet, a cellular phone network, another device such as one's personal computer, or any desired data source.
- WAN Wide Area Network
- the External Wireless Network 103 may enable the transfer of text data from the Dictionary System 110 to other devices for delivery to the intended recipients.
- Dictionary System 110 may be contained within a mobile device such as a Personal Digital Assistant (PDA), cellular phone, computerized organizer, personal computer, Blackberry or similar device, as is well known by those skilled in the art. While the disclosed invention is, in some embodiments, shown for use by mobile devices, the present invention is not intended to be limited to devices that are mobile. For example, in some embodiments, the present invention may be utilized upon a standard desktop computer, cash register, land line telephone, or any text capable device.
- PDA Personal Digital Assistant
- cellular phone computerized organizer
- personal computer personal computer
- Blackberry Blackberry or similar device
- the User 101 is not required for the Automated Dictionary Population System 100 .
- the Dictionary System 110 may perform dictionary population without receiving input from the User 101 .
- Interface 111 may be a keypad, touch screen, stylus pad, or any input device. Additionally, in some embodiments, Interface 111 may also provide an output such as a screen or sound output. Alternate systems of input and output may be utilized by the Interface 111 as is well known by those skilled in the art. The Interface 111 facilitates input from the User 101 to the Processor 117 .
- Messages provided by the User 101 through the Interface 111 may be stored by the Message Storage 113 .
- messages received by the mobile device from the External Wireless Network 103 via the Wireless Connector 121 may, likewise, be stored by the Message Storage 113 .
- the Message Storage 113 may additionally be referred to as an ‘inbox’ or similar term.
- the Message Storage 113 is of finite size, although that size may be very large in a modern mobile device. Messages may be deleted when the User 101 has no further need of them or may be deleted automatically when a time limit is reached. Regardless of the actual mechanism, Message Storage 113 contents may be regarded as temporary in nature.
- the Dictionary Set 115 may include the static root, or first, dictionary as well as user populated dictionaries, including the supplemental word list, i.e. the dictionaries being populated by the present invention.
- the supplementary word lists may be stored as a single list which may be considered to be a ‘used word’ list. Otherwise, these supplementary word lists may be stored as one or more separate word lists, each having a reference entry that allows access to these particular lists only during text exchanges which use at least some of the terms or words stored therein.
- a message sent to John Smith could search not only the main dictionary and the personal word list, but also a used word list and a list of words used in messages received from John Smith.
- a list of all received words is kept and is accessible from any application where text entry is used. Details of the architecture of the Dictionary Set 115 will be provided below.
- Duplication of words is wasteful; storing the same word more than once outside the main dictionary is not necessary.
- an advantage is that the word is accessed earlier because it has become more frequently used than might be implied from the main dictionary. It is thus beneficial to store pointers to words in order to control memory usage, and also allowing phrases to be constructed by directing to particular words regardless of their actual location.
- the Processor 117 may perform the analysis and computations required to populate the Dictionary Set 115 . Upon initial startup, the Processor 117 may sequentially read each message and extract every word contained in these messages. This extracted word list is then stored as a supplementary dictionary list in the Dictionary Set 115 . Each time thereafter, when a message is received, the text from that received message is extracted and parsed and the words are added to this dictionary. In some cases, words will be repeats of those already stored in the main dictionary. Details of the architecture of the Processor 117 will be provided below.
- GSM short message service there are several methods of handling received messages. Normal messages which contain displayable text may be presented for the User 101 on command, and read in the normal fashion. Other messages may be sent which contain machine level instructions for the receiving device and allow User 101 action to cause certain transactions that are not normal messaging transactions.
- This invention is mainly concerned with readable messages intended for the User 101 . It is also the case that electronic mail has the same essential characteristics; and, in fact, any messaging application can be treated in the same way by the Automated Dictionary Population System 100 .
- the message When a message is opened to be read by the User 101 , the message may be parsed and a temporary list of words may be created. Each word is tested to see if it is already stored in the used word dictionary. Since there is no need to duplicate the word if it has already been stored in the used word dictionary, such repeat words may be discarded. If a word is not found in the used word dictionary, it may be appended to the list so that the list extends downwards with the last entries at the end. This feature may be beneficially used to search recent entries.
- the Automated Dictionary Population System 100 may be enabled to group phrases so that components of terms of art may be stored.
- medical terms and legal terms routinely use word groups; as an example, consider terms such as res ipsa loquitur and mutatis mutandis where neither term is best stored as separate parts.
- each term may be fabricated from a string of single words, it is advantageous if the words that make up the terms are linked. Medical terms are notoriously lengthy and similarity between words may convey entirely the wrong information. In this case, linkage between words may be even more beneficial.
- FIG. 2 shows a logical block diagram of the Dictionary Set 115 for the Automated Dictionary Population System 100 of FIG. 1 .
- the Dictionary Organizer 201 may provide organization for the Dictionary Set 115 as well as coupling the Dictionary Set 115 to the other components of the Dictionary System 110 , as illustrated by a Cloud 200 .
- the Dictionary Set 115 may also include a Static Dictionary 211 , a Used Word List 213 , a Supplemental Dictionary 215 , and a Profanity List 217 . In some embodiments, more or fewer dictionary partitions may be included within the Dictionary Set 115 .
- each dictionary within the Dictionary Set 115 may be further subdivided into sub-dictionary lists.
- the Supplemental Dictionary 215 may be divided into multiple supplemental word lists, accessible only when addressing a particular recipient or when discussing terms found in such a list.
- the Static Dictionary 211 may be referred to as the first dictionary, root dictionary, original dictionary, or base word list.
- the content of Static Dictionary 211 is typically preloaded by the manufacturer of the mobile device.
- the Static Dictionary 211 is typically not amendable by the User 101 .
- the Static Dictionary 211 may be formulated from a substantial corpus in the target language, and may contain any number of words, dependent upon manufacturer desires and availability of storage resources. However, in many current mobile devices, the Static Dictionary 211 may include a corpus of approximately 10,000 to 100,000 words on average.
- the Used Word List 213 may be populated by words that have been used by the User 101 or received by the Dictionary System 110 via the External Wireless Network 103 .
- the Used Word List 213 may then be appended as additional words are received.
- the Used Word List 213 may have an associated ordering algorithm.
- words are not duplicated within the Used Word List 213 and Static Dictionary 211 . Instead, a reference is placed within the Used Word List 213 to the word found in the Static Dictionary 211 .
- multiple usages of particular words will not result in duplication within the Used Word List 213 , but rather, each word within the Used Word List 213 may include a counter to track frequency of use. Such usage tracking may be utilized to provide predictions of words to the User 101 during message composition.
- Frequency and Recency are the two elements that may be used to force an order to the assorted lists. These two elements are both embodied in the concept of ‘likelihood’. Usage frequency need not be any absolute numerical value. In some embodiments, it suffices to store data representative of relative frequency. In the minimal form, list ordering may be used to imply relative frequency. Moreover, since recency is also a valuable index of likelihood, this too may be used as a parameter.
- the Supplemental Dictionary 215 may be a particular type of used word list. As such, in some embodiments, the Used Word List 213 and Supplemental Dictionary 215 may, in fact, be one and the same. However, due to the particular structure desired for the Supplemental Dictionary 215 , in some embodiments, it has been distinguished as a separate component of the Dictionary Set 115 . For example, it may be beneficial to separate the organization of certain words based on the symbol set or font detail.
- the Supplemental Dictionary 215 enables preservation of the Static Dictionary 211 whilst permitting a personal list of items such as proper names or terms of art relevant to a particular User 101 to be stored.
- the method of dictionary population disclosed by this invention involves the generation and promulgation of the Supplemental Dictionary 215 .
- the Supplemental Dictionary 215 may be stored as a single list. Otherwise, the Supplemental Dictionary 215 may be stored as one or more separate word lists each having a reference entry that allows certain ones of these lists to be accessed only with text exchanges which use these terms or words.
- the Dictionary Set 115 may also include a Profanity List 217 .
- the Profanity List 217 enables profanities and expletives to be identified. Profanities may be determined by community, or target consumer standards. Profanities may include words and phrases native to the user's language, as well as commonly used slang or foreign profanities. In some embodiments, context of the word may likewise be analyzed to determine if its usage is deemed profane. The Automated Dictionary Population System 100 may then resolve the use of the profanity whereby the User 101 is not overly inconvenienced, or offended.
- the Dictionary Set 115 may also include a Frequently Misspelled Word List 219 .
- the Frequently Misspelled Word List 219 enables identification of misspelled words so that these words are not used to populate the dictionary.
- the difficulty caused by improper spelling may be resolved through the use of the Frequently Misspelled Word List 219 and dictionary error distance calculations. Error distance may be calculated for words, and those which have low error distances may be used to estimate which candidates are most likely to have been intended. Although this may prove disruptive to a user in the early stages, a simple query may be presented that allows the removal of erroneously stored words. This may be resolved simply by marking the word or word group when they are retrieved as candidates.
- misspelled word recieve would appear italicized or otherwise distinguished in addition to the correctly spelled word. Selection of a seemingly misspelled word would confirm its probable valid status and promote its likelihood of retrieval whereas non-selection would demote it. Automatic removal is possible but must be approached with great care. Capitalized words, in some embodiments, should not be routinely eliminated.
- FIG. 3 shows a logical block diagram of the Processor 117 for the Automated Dictionary Population System 100 of FIG. 1 .
- the Coupler 301 may couple the Processor 117 to the other components of the Dictionary System 110 , as illustrated by the Cloud 200 .
- the Processor 117 may additionally include a Word Extractor 311 , a Dictionary Comparer 313 , a Profanity Interrupter 315 , a Statistical Engine 317 and a Word Storage Moderator 319 .
- the Word Extractor 311 parses the messages and extracts words, where the Dictionary Comparer 313 then compares the extracted words to those already stored within the Dictionary Set 115 . If a profanity is identified, the Profanity Interrupter 315 may perform an interruption to resolve the profanity.
- the Statistical Engine 317 may provide word prediction during text entry, as well as the ability to determine phrases through identification of joined words.
- the Word Storage Moderator 319 may direct the storage of new words within the Dictionary Set 115 .
- FIG. 4 shows a logical block diagram of the Word Extractor 311 for the Processor 117 of FIG. 3 .
- the Word Extractor 311 may include a Retriever 411 and a Message Parser 413 coupled to one another. Likewise, the Retriever 411 and Message Parser 413 may couple to the other components of the Processor 117 , as illustrated by the Cloud 400 .
- the Retriever 411 may retrieve messages from the Message Storage 113 for analysis. Retrieval may be automated by a trigger, or by timing. For example, retrieval may occur when the User 101 opens a message for viewing. In this way the Dictionary System 110 may gain feedback from the User 101 in instances where clarification is desired. In some embodiments, message processing may be deferred when available power is below a certain threshold and a large amount of data may be present for processing. In such an instance, a particular message may be saved for later if User 101 feedback is desired. Dispute resolution may then be achieved through user intervention.
- the Message Parser 413 may parse the message into individual words for the extraction. In some embodiments, the Message Parser 413 may also be configured to identify indicators of a phrase. In these embodiments, the Message Parser 413 may parse the individual words of the suspected phrase, as well as parse the entire intact phrase for analysis by the Statistical Engine 317 .
- FIG. 5 shows a logical block diagram of the Statistical Engine 317 for the Processor 117 of FIG. 3 .
- the Statistical Engine Coupler 501 may couple the Statistical Engine 317 to the other components of the Processor 117 , as illustrated by the Cloud 400 .
- the Statistical Engine 317 may additionally include a Phrasing Analyzer 511 , a Referencer 513 , a Word Frequency Tracker 515 , a Recipient Analyzer 517 and a Predictor 519 each coupled to one another.
- the Word Frequency Tracker 515 may include tracking word use frequency and word recency.
- the Phrasing Analyzer 511 may take the parsed language generated by the Message Parser 413 and identify the phrases. The Phrasing Analyzer 511 may also link particular words for later predictive processes.
- the Predictor 519 predicts words for the creation of candidate word lists.
- the Predictor 519 may use fuzzy logic in order to select the candidate word lists. Fuzzy logic is derived from fuzzy set theory dealing with reasoning that is approximate rather than precisely deduced from classical predicate logic. It can be thought of as the application side of fuzzy set theory dealing with well thought out real world expert values for a complex problem.
- the Referencer 513 may reference words already located within the Dictionary Set 115 , thereby eliminating the need for duplicate storage of words.
- the Word Frequency Tracker 515 may keep track of the frequency of word usage. Again, by tracking frequency, multiple uses of a single word will still result in a single word entry within the Dictionary Set 115 , thus saving storage resources. Also, the frequency of word use may be utilized by the Predictor 519 to generate candidate lists for the User 101 during text entry.
- the Word Frequency Tracker 515 may compile simple indicia of a word's gross usage and recency. By appending to a list, recency steps occur naturally, since when indexed from the end, backwards, the most recent words are identified.
- this word can be de-referenced and, at a convenient time, the list may be shuffled or compacted to eliminate the earlier instance of a recent word. If needed, the list may be augmented by keeping a note of how often the word has been used.
- the Word Frequency Tracker 515 may provide a more detailed and useful analysis of frequency. For example, more advanced versions of the Word Frequency Tracker 515 may provide word frequency use when the message is directed toward a particular recipient. Likewise, in some embodiments, the Word Frequency Tracker 515 may generate multiple frequency indicia for a word, dependent upon the preceding word(s), general message content, sentence grammar or other variable. In this way, the Word Frequency Tracker 515 may generate a rich set of frequency statistics for a more refined, and ultimately more useful, word prediction by the Predictor 519 . Complexity of the Word Frequency Tracker 515 may depend upon manufacturer's desires, and may consider storage and computation resources available to the Dictionary System 110 .
- the Recipient Analyzer 517 may analyze message recipient to generate data regarding word usage frequency by, or to, each recipient, and to also aid in the generation of recipient specific supplemental word lists of the Supplemental Dictionary 215 .
- FIG. 6 shows a logical block diagram of Phrasing Analyzer 511 for the Statistical Engine 317 of FIG. 5 .
- the Phrasing Analyzer 511 may include a Phrase Group Identifier 611 and a Linker 613 coupled to one another.
- the Phrase Group Identifier 611 and Linker 613 may couple to the other components of the Statistical Engine 317 , as illustrated by the Cloud 600 .
- the Phrase Group Identifier 611 may identify words strings which form phrases.
- the Linker 613 may provide links for the words of the phrase so that the actual phrase need not be stored in its entirety.
- adjacent words When a group of adjacent words is parsed from the text message, if none are to be found in the main dictionary, they may be stored with additional information that allows them to remain linked. Two or more adjacent words may form a phrase or term of art or an associated word group. Words which precede or follow the group will generally be found in the main dictionary. Phrases may also be identified by explicit mean such as capitalization, quotation marks surrounding the phrase, by underlining or marking in a distinct way. This latter is common in Chinese; for example, where characters that are intended to be read as a single phrase, such as a name, may be underlined and thus conjoined. In alphabetic based languages, it is common to find joining words such as of or in, used along with capitalized words.
- the Phrasing Analyzer 511 may receive Cost of Goods or Moreton-in-Marsh from the Message Parser 413 , and the Phrasing Analyzer 511 may be configured to identify these word groups as related or associated word structures. These associations between words may be stored in a way that enables them to be easily recalled by the User 101 .
- the word “and” is a strong joining word feature.
- the Cockney dialect of English has a strong “rhyming slang” format; “apples and pears” is used to substitute for “stairs” whereas “trouble and strife” would be used to mean “wife”.
- semantic rules it may be possible to detect relationships of this nature between words in a message.
- any capitalized words separated by known “joining” words may be treated as a group.
- FIG. 7 shows an illustration of a user interaction with a wireless mobile device, shown generally at 700 .
- the User 101 is seen interacting with a Dictionary System 110 , which is, in this exemplary illustration, a mobile device.
- the Dictionary System 110 includes a Display 713 , Keypad 715 and Microphone 717 , which collectively comprise the Interface 111 of the Dictionary System 110 .
- the Keypad 715 in the exemplary illustration may include a non-deterministic, or ambiguous, keypad, or a deterministic style keypad.
- the Dictionary System 110 may be coupled, wirelessly, to the External Wireless Network 103 via a Wireless Receiver 705 .
- the Wireless Receiver 705 may include a Bluetooth adapter, radio tower, access point, or any other wireless signal intermediary.
- the Dictionary System 110 may rely upon a wired connection to couple to the External Wireless Network 103 .
- the intent of these exemplary illustrations, as seen in FIG. 7 is to show an exemplary variety of device configurations that the Automated Dictionary Population System 100 is designed for.
- FIG. 8 shows an illustration of an Ambiguous Style Keypad 800 associated with many mobile devices. Such a Keypad 800 may be often found upon phones and other devices with limited key space.
- each Numerical Key 810 , 820 , 830 , 840 , 850 , 860 , 870 , 880 , 890 contains both a Numeral 811 , 821 , 831 , 841 , 851 , 861 , 871 , 881 , 891 , and a set of three or four Letters 812 , 822 , 832 , 842 , 852 , 862 , 872 , 882 , 892 .
- the Letters 812 , 822 , 832 , 842 , 852 , 862 , 872 , 882 , 892 may be that of any language desired and is not limited to the Roman alphabet.
- the non-numeric Keys 801 , 802 and 803 may likewise include characters and symbols, such as punctuation and spaces.
- the Ambiguous Keypad 800 may rely upon the number of times any particular Numerical Key 810 , 820 , 830 , 840 , 850 , 860 , 870 , 880 , 890 is pressed to generate a specific letter, or character. Alternatively, in some embodiments, the device may interpret a string of key hits and disambiguate the intended letters. Lastly, in some embodiments, a combined system of multiple key hits and disambiguation may be utilized for text entry into an Ambiguous Keypad 800 .
- FIG. 9 shows an illustration of the Deterministic Keypad 715 , or “full” keyboard, wherein the numerical inputs share a physical key with alphabetical inputs.
- the Deterministic Keypad 715 has one symbol per letter in the Latin set, and 12 keys are labeled with the numbers 0 through 9 and the characters * and # to correspond with the normal touch tone keys.
- Dualistic Keys 988 , 989 , 990 , 991 , 992 , 993 , 994 , 995 , 996 , 997 998 and 999 each provide numeric and alphabetic input.
- the remaining Alphabetic Keys, 901 , 902 , 903 , 904 , 905 , 906 , 907 , 908 , 909 , 910 , 911 , 912 , 913 , 914 , 915 and 916 provide only a single alphabetic character input.
- FIG. 10 shows a flow chart illustrating a process of automated dictionary population, shown generally at 1000 .
- the process begins and then progresses to step 1010 where the message is received.
- Messages may be received through the User 101 inputting a message on the Dictionary System 110 .
- the Dictionary System 110 may receive a message from the External Wireless Network 103 , such as an email or SMS.
- step 1020 the message is stored. While it is conceivable that the Automated Dictionary Population System 100 may process messages upon receipt, thereby eliminating the need to store the message, it may be desirous to store the message until the User 101 interacts with the message so that the User 101 may be queried for feedback when necessary.
- the Message Storage 113 may store the message.
- step 1030 the message is processed for dictionary population.
- the Processor 117 may perform the processing of the message. The details of message processing will be described in more detail below.
- step 1040 the words populating the dictionary may be recalled. This may occur during predictive word presentation as a candidate word to the User 101 during text input by the User 101 . Prediction of words may utilize the Predictor 519 . The process then ends.
- FIG. 11 shows a flow chart illustrating a process of message processing, shown generally at 1030 .
- the process begins from step 1020 of FIG. 10 .
- the process then progresses to step 1101 where words are extracted from the message. Extraction may be performed by the Word Extractor 311 .
- the process then progresses to step 1109 where the extracted words are compared against the words preexisting within the Dictionary Set 115 . This function may be performed by the Dictionary Comparer 313 .
- step 1104 slang and misspelling is resolved through comparison to the Frequently Misspelled List 219 .
- dictionary error distance may be calculated for words, and those which have low error distances may be used to estimate which candidates are most likely to have been intended. Although this may prove disruptive to a user in the early stages, a simple query may be presented that allows the removal of erroneously stored words. This may be resolved simply by marking the word or word group when they are retrieved as candidates.
- step 1105 an inquiry is made as to whether the word is found within the Dictionary Set 115 . If the word is not yet stored within one of the dictionaries of the Dictionary Set 115 , the process then progresses to step 1111 where the word is stored within the Supplemental Dictionary 215 by the Word Storage Moderator 319 . Then, at step 1113 , statistical analysis may be performed upon the newly stored word. Statistical analysis may utilize the Statistical Engine 317 , and may include frequency analysis, sender analysis and additional statistical measures. The process then concludes by progressing to step 1040 of FIG. 10 .
- step 1105 if at step 1105 , the word is found within the Dictionary Set 115 , the process then progresses to step 1107 where an inquiry is made as to whether the word is found within the Profanity List 217 . If the word is a profanity, the process then progresses to step 1109 where a profanity interruption process may be performed by the Profanity Interrupter 315 . The process then concludes by progressing to step 1040 of FIG. 10 .
- step 1107 the process then progresses to step 1113 where statistical analysis may be performed upon the previously stored word.
- Statistical analysis may utilize the Statistical Engine 317 , and may include frequency analysis, sender analysis and additional statistical measures. The process then concludes by progressing to step 1040 of FIG. 10 .
- FIG. 12 shows a flow chart illustrating a process of word extraction, shown generally at 1101 .
- the process begins from step 1020 of FIG. 10 .
- the process then progresses to step 1201 where the message is retrieved from storage within the Message Storage 113 .
- Retrieval may utilize the Retriever 411 .
- retrieval may be initiated when there is a triggering event, such as connection of the Dictionary System 110 to an external power source, or the opening of a message by the User 101 .
- step 1203 the message is parsed for individual words. Parsing may utilize the Message Parser 413 . The process then concludes by progressing to step 1103 of FIG. 11 .
- FIG. 13 shows a flow chart illustrating a process of profanity interruption, shown generally at 1109 .
- a common problem with personal communications is that the informality leads to the propagation of written messages whose content may be profane or laden with expletives. Seemingly sane users may cast caution to the wind, assuming that the message is private and will not be shared. The consequence of this invention is that obscenities may be gathered unwittingly and lead to embarrassment if someone other than the owner attempts to use the appliance.
- step 1301 The process begins from step 1107 of FIG. 11 .
- the process then progresses to step 1301 where some portion of the profanity is replaced by some place marker.
- some place marker In some embodiments, all but the first letter of the profanity may be replaced by asterisks. In some alternate embodiments, only vowels are replaced.
- Place markers may be any symbol desired, such as the pound symbol (#), an asterisk symbol (*), exclamation marks (! or any other desired symbol.
- This modified profanity may then be displayed to the User 101 at step 1303 .
- the intent in modifying the profanity in this way is to avoid offending the delicate User 101 .
- the User 101 may have intended to use the profanity, so it is equally important that this word selection be provided in a candidate word listing.
- the User 101 may rethink its usage, and avoid flippant use of words which may cause interpersonal or business relationship harm.
- the User 101 may be prompted for an action at step 1305 .
- step 1307 an inquiry is made as to whether the User 101 explicitly selects the modified profanity as the intended word. If the user makes such an explicit selection of the modified profanity, the word may be shown to the User 101 as an unmodified word at step 1309 .
- the profanity may also be added to the Used Word List 213 at step 1311 .
- the word may be linked to a particular recipient, so that in future uses of the word it will still be treated as a profanity in most scenarios, but be treated as a regular word when used with “familiar” or “informal” contacts.
- the process then concludes by progressing to step 1040 of FIG. 10 .
- the modified profanity may be removed from the candidate word listing. The process then concludes by progressing to step 1040 of FIG. 10 .
- FIG. 14 shows a flow chart illustrating a process of statistical analysis of words, shown generally at 1113 .
- the process begins from step 1107 or 1111 of FIG. 11 .
- the process then progresses to step 1401 where the Phrasing Analyzer 511 analyzes the message for word groups.
- step 1403 word use likelihood, including frequency and recency, may be indexed by the Word Frequency Tracker 515 .
- word use likelihood including frequency and recency
- the Automated Dictionary Population System 100 eliminates the need to repetitively store multiple copies of a particular word in the Dictionary Set 115 . Also, these indices may be of particular use in the generation of predictive candidate word lists.
- Frequency tracking may be a simple count of word use, or may, in some embodiments, involve more sophisticated tracking of word use by sentence structure, message content, proximate words, or intended recipient.
- word use likelihood is indexed by recipient.
- the verbiage utilized when speaking to one's lover, mother, friend or business associate may vary greatly.
- predictive candidate lists may be more finely tuned when writing a message to a known recipient.
- step 1407 language is analyzed for affect; that is, the emotional effect invoked by the message.
- Particular words or phrases may be identified which denotes particular emotional response.
- particular grammar may also denote mood of the message. For example, speech patterns directed to a teen friend, versus a parent or employer may be identified. Monitoring affection in the language may be particularly useful in generation of candidate word lists.
- the process then concludes by progressing to step 1040 of FIG. 10 .
- FIG. 15 shows a flow chart illustrating a process of analysis for word groups, shown generally at 1401 .
- the process begins from step 1107 or 1111 of FIG. 11 .
- the process then progresses to step 1501 where the Phrase Group Identifier 611 identifies phrase groups.
- phrase groups As noted earlier, when a group of adjacent words is parsed from the text message, if none are to be found in the main dictionary, they may be stored with additional information that allows them to remain linked. Two or more adjacent words may form a phrase or term of art or an associated word group. Words which precede or follow the group will generally be found in the main dictionary. Phrases may also be identified by explicit mean such as capitalization, quotation marks surrounding the phrase, by underlining or marking in a distinct way.
- step 1503 the Linker 613 links the words identified as a phrase.
- the Automated Dictionary Population System 100 minimizes the need to store each phrase separately. Instead, where the phrase includes words found in the Dictionary Set 115 , each of the already stored words may further include links to generate the phrase. This enables conservation of storage resources.
- the process then concludes by progressing to step 1403 of FIG. 14 .
- FIG. 16 shows a flow chart illustrating a process of identifying phrase groups, shown generally at 1501 .
- the process begins from step 1107 or 1111 of FIG. 11 .
- the process then progresses to step 1601 where adjacent words not found in the Dictionary Set 115 are identified as a potential phrase group. If additional indications of a phrase are present, such as capitalization, quotes or joining words, then the system may automatically save the word string as a phrase. If there are no other indications that the word string is a phrase, the system may query the User 101 to resolve the ambiguity.
- step 1603 capitalized phrases are identified.
- step 1605 quoted phrases are identified; and at step 1607 italicized phrases are identified.
- step 1609 semantic rules may be utilized to determine phrases.
- semantic analysis may include identifying “joining words” and particular rhyme or cadence associated with phrases. Often the User 101 may be queried to resolve ambiguities on whether a particular set of words includes a phrase.
- step 1611 common abbreviations and acronyms which designate phrases are identified. As noted, these are common in business settings; however, such “shorthand” is likewise becoming increasingly common during casual messaging with terms such as “lol”, “bff” and “cul8tr”. The process then concludes by progressing to step 1403 of FIG. 14 .
- the present invention relates generally to automated dictionary generation system and method to provide fast, accurate and resource efficient population of personalized dictionaries. Additionally, this rapid dictionary population enhances early use of a mobile device, provide comprehensive profanity protection and aids in rapid text input on a mobile device. In this way the automated dictionary generation system and method may provide an invaluable tool for device manufacturers and device users.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
- This application is a continuation of U.S. patent application Ser. No. 13/772,139, filed Feb. 20, 2013, which is a continuation of U.S. patent application Ser. No. 13/434,730, filed Mar. 29, 2012, now U.S. Pat. No. 8,386,241, issued Feb. 26, 2013, and which is a continuation of U.S. patent application Ser. No. 12/135,142, filed Jun. 6, 2008, now U.S. Pat. No. 8,180,630, issued May 15, 2012, each of which is incorporated herein in its entirety by this reference thereto.
- This invention relates generally to generation of a personalized dictionary for portable devices. More particularly, the present invention relates to a method for populating a personalized dictionary in a semi automated fashion. This is achieved through the analysis of communication messages written, spoken, sent or received on the portable device. Text may include any written characters, or transcriptions of verbal messages. Such text or verbal message may include text using Roman based alphabets, Chinese alphabet, Arabic scripts, or virtually any known language's symbology.
- In today's increasingly mobile population, the ability to input text into a mobile device is becoming more desirable. Emails, appointments and text messages are routinely inputted into mobile devices, including Personal Digital Assistants (PDA's), cell phones and computerized organizers.
- For the business person, the ability to send emails and document appointments, while on the go, enables a jumpstart into the workday, increased productivity and enhanced flexibility. For the teenage, or other casual user, text messaging has become an exceedingly common phenomena and a form of social currency.
- Mobile devices typically have less processing power and storage resources available than a stationary computer system. Additionally, due to the small size of these personal appliances, keypads are typically very small or require multiple keytaps. This small, highly portable size of the devices that enable mobile text connectivity also render the input of such text onerous.
- In response, typical personal portable appliances may include utilities that facilitate the generation or entry of textual material for messaging purposes. In general, these utilities may be one of several types, or some combination, including: i) systems which allow a user to enter text letter by letter using a scheme where a letter on a key is specifically identified in a deterministic fashion commonly called multi-tap systems, and ii) systems which match sequences of keys to word possibilities either algorithmically or by matching pre-stored dictionary entries, and iii) fully deterministic systems having a one to one correspondence to desired symbols such as a full keyboard, albeit miniaturized. These latter systems, of course, allow complete flexibility of symbol string entry.
- In all of these systems, considerable benefit may be realized by providing the user with candidate words for selection by the user prior to completion. Particularly for long words, this predictive presentation of candidate words may save the user considerable typing time and keystrokes. Ordered dictionaries may be used to supply candidates and, given a well populated dictionary, results can be very good for many applications.
- As noted, result quality is a strong function of the dictionary ordering strategy, so considerable effort is required to tune system performance so that the user experience is satisfactory. Poor candidates are a distraction rather than a benefit for the user, thus well populated dictionaries are a virtual necessity.
- However, due to storage limitations in these portable devices, the dictionaries relied upon are necessarily not exhaustive word lists. Additionally, even were one able to have an exhaustive dictionary, querying such a database would be impractical for real time word candidate prediction, particularly for personal devices with limited processing ability.
- As such, in typical systems, there are three essential components to the dictionary. The first is a static dictionary which is formulated from a substantial corpus in the target language. Such static dictionaries may additionally be referred to as a static element, base dictionary, first dictionary or static word list. In the initial use of the appliance, the performance of the utility is dominated by this static element. Although such a static dictionary may be changed in some modern appliances, such static dictionaries are, at best, quasi-static since changing content may confuse or distract the user and may confound manufacturer support activities.
- The second dictionary component is a used word listing that may have an associated ordering algorithm. Such a used word list may additionally be referred to as a used word dictionary, usage dictionary, second dictionary or common word list. Whenever a user creates a message, words used in message creation are added to a dictionary that stores used words. This used word dictionary is helpful in that words and text constructs peculiar to that user are saved. Since a user tends, by and large, to use words and structures that have become habitual, and thus personal to the user, intended words may be predicted based upon the usage patterns established. This is believed to speed system response, generally, since users tend to re-use certain words and it is far better to keep a separate entry list than to attempt to manage the full dictionary; again system support is eased if the primary dictionary is kept fairly static.
- A third list may be present that allows a user to create words that may be absent from the primary dictionary. Such a third list may additionally be referred to as a supplemental element, supplemental dictionary, third dictionary or supplemental word list. The supplemental dictionary allows preservation of the root dictionary whilst permitting a personal list of items, such as proper names or terms of art, relevant to a particular user to be stored.
- Currently the population of the used word list and supplemental dictionary may require the user to input many words in full. That is, the user may be required to type in an entire word, often requiring the user to switch input modes to a deterministic input. Switching input modes may inconvenience the user, slow down messaging, and generally reduce efficiency and usability of the portable device. This inconvenience additionally occurs at a time when the dictionaries are sparsely populated, thus rendering generation of predictive candidates words limited, or worse, erroneous.
- Another current method of addressing such an issue is to attempt to preload dictionary sets so that the user has fewer words to manually input. This has been met with mixed success, since such predetermined lists are very costly and difficult to compile, and are often non-reflective of what terms and words the user desires to use.
- Thus, in the typical mobile device, the current lack of rapid dictionary population may be inadequate as requiring too much manual attention from the users, or requiring too much storage for exhaustive dictionary sets. Manufacturers and retailers of mobile devices would benefit greatly from the ability to offer devices with accurate and rapid dictionary word population. Additionally, users of these mobile devices would benefit greatly by having reduced aggravation and more efficiency when initially inputting text on the mobile device.
- The current invention aids in automating, at least in part, the creation of the supplemental dictionary. A considerable benefit is that caller name records may be built rapidly as may be terms of art, thus freeing the user from the laborious task of creating each entry one by one.
- It is therefore apparent that an urgent need exists for an improved system and method for automated dictionary population that is both accurate and efficient. This solution would replace current practices of making the user deterministically input each unknown word with a more efficient and rapid system with regards to mobile devices; thereby increasing effectiveness and general usability of text input performed on a mobile device.
- To achieve the foregoing and in accordance with the present invention, a method and system for automated dictionary population is provided. Such a system is useful for a user of mobile devices to efficiently produce text data yet avoid much of the laborious task of deterministically entering every new word for storage and future use.
- The mobile device, or personal appliance, may include at least one dictionary which includes entries. Every time the device receives a communication intended for the user, the information may be parsed and textual data extracted. The text is then compared to the entries of the dictionaries to identify new words. Statistical information may be generated for the parsed words. This information includes word usage frequency, recency, or likelihood of use.
- Profanities may be processed by identifying profanities within the parsed words by comparing the parsed words to a profanity word list, modifying the profanities by replacing at least some of the profanity with a place marker and displaying the modified profanity to a user in a candidate list. Then the user may be asked to provide feedback either selecting or deselecting the profanity. Selecting the profanity results in displaying the profanity to the user and storing the profanity. De-selecting the profanity removes the profanity from the candidate list.
- Phrases from the parsed words may be identified by phrase markers, which may include at least one of italicized word groups, quoted word groups, bolded word groups, capitalized word groups, word groups containing more than one new word, and groups of words including joining words.
- Lastly, the new words may be stored in a supplementary dictionary or word list. These words may be stored as single words or may be stored utilizing linking the words of the identified phrases to preserve any phrase relationships. This is valuable in the case of certain professions where a phrase may be a term of art and the individual words may be less useful when used alone. Likewise, the statistical information may be stored.
- By using communicated data in this way, pertinent material may be gleaned without deliberate user activity. This results in a rapid accumulation of words and terms beyond those found in the static dictionary or word list, which words are personal to that user by virtue of having been used in exchanges. Names may also be marked as special and related to other directories.
- Voice data may also be processed and harvested for word samples in the same way. In at least one application, voice messages may be machine converted to textual form external to the mobile appliance and submitted to the appliance using the GSM short message service or similar service.
- When coupled with word prediction or completion methods, the stored data harvested in these ways allows relevant candidates to be shown to the user more frequently than those extracted from a static dictionary constructed from corpora having a broader or more general statistical bias. Moreover, by combining words that are related to form phrases, it has been found that a substantial improvement in the candidate quality and a reduction in required keystrokes is usual.
- These and other features of the present invention may be practiced alone or in any reasonable combination and will be discussed in more detail below in the detailed description of the invention and in conjunction with the following figures.
- In order that the present invention may be more clearly ascertained, one embodiment will now be described, by way of example, with reference to the accompanying drawings, in which:
-
FIG. 1 shows a logical block diagram of an automated dictionary population system in accordance with an embodiment of the present invention; -
FIG. 2 shows a logical block diagram of a dictionary set for the automated dictionary population system ofFIG. 1 ; -
FIG. 3 shows a logical block diagram of a processor for the automated dictionary population system ofFIG. 1 ; -
FIG. 4 shows a logical block diagram of a word extractor for the automated dictionary population system ofFIG. 1 ; -
FIG. 5 shows a logical block diagram of a statistical for the automated dictionary population system ofFIG. 1 ; -
FIG. 6 shows a logical block diagram of a phrasing analyzer for the automated dictionary population system ofFIG. 1 ; -
FIG. 7 shows an illustration of a mobile device in conjunction with a communication network in accordance with an embodiment of the present invention; -
FIG. 8 shows an illustration of an ambiguous style keypad associated with the mobile device in accordance with an embodiment of the present invention; -
FIG. 9 shows an illustration of a deterministic style keypad associated with the mobile device in accordance with an embodiment of the present invention; -
FIG. 10 shows a flow chart illustrating a process of automated dictionary population in accordance with an embodiment of the present invention; -
FIG. 11 shows a flow chart illustrating a process of message processing in accordance with an embodiment of the present invention; -
FIG. 12 shows a flow chart illustrating a process of word extraction in accordance with an embodiment of the present invention; -
FIG. 13 shows a flow chart illustrating a process of profanity interruption in accordance with an embodiment of the present invention; -
FIG. 14 shows a flow chart illustrating a process of statistical analysis of words in accordance with an embodiment of the present invention; -
FIG. 15 shows a flow chart illustrating a process of analysis for word groups in accordance with an embodiment of the present invention; and -
FIG. 16 shows a flow chart illustrating a process of identifying phrase groups in accordance with an embodiment of the present invention. - Introduction
- The present invention will now be described in detail with reference to several embodiments thereof as illustrated in the accompanying drawings. In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be apparent, however, to one skilled in the art, that the present invention may be practiced without some or all of these specific details. In other instances, well known process steps and/or structures have not been described in detail in order to not unnecessarily obscure the present invention. The features and advantages of the present invention may be better understood with reference to the drawings and discussions that follow.
- The present invention relates generally to semi automated dictionary population system and method to provide fast and efficient dictionary generation and personalization for mobile devices (also known as a personal appliance). More particularly, the present invention relates to a method for dictionary population that requires fewer storage resources and less distracting inputs from the user.
- In current systems, each time a user wishes to use a word that is not a part of the root dictionary the new word must be created and stored. Generally when a non-deterministic keyboard is used, a user must interrupt the task at hand and enter the new word in some deterministic fashion. In a typical appliance such as a cellular telephone, it may mean that a user will have to change entry modes to use a multi-tap scheme to create this new word. By using alternate sources of information to supplement a user's dictionary, such as that which is disclosed by the present invention, a significant improvement may be realized over the old systems.
- II. Automated Dictionary Population System
- To facilitate discussion,
FIG. 1 shows a logical block diagram of an AutomatedDictionary Population System 100. The AutomatedDictionary Population System 100 may include aUser 101 which interacts with aDictionary System 110. Additionally, theDictionary System 110 may, in some embodiments, interface anExternal Wireless Network 103. TheDictionary System 110 may, in some embodiments, provide population of dictionaries. - The
Dictionary System 110 may include anInterface 111, aMessage Storage 113, aDictionary Set 115, aProcessor 117 and aWireless Connector 121. TheInterface 111 may enable theUser 101 to interact with theDictionary System 110. Likewise, theWireless Connector 121 may enable theDictionary System 110 to access theExternal Wireless Network 103. - The
External Wireless Network 103 may include a Wide Area Network (WAN) such as the internet, a cellular phone network, another device such as one's personal computer, or any desired data source. Typically, in some embodiments, theExternal Wireless Network 103 may enable the transfer of text data from theDictionary System 110 to other devices for delivery to the intended recipients. -
Dictionary System 110 may be contained within a mobile device such as a Personal Digital Assistant (PDA), cellular phone, computerized organizer, personal computer, Blackberry or similar device, as is well known by those skilled in the art. While the disclosed invention is, in some embodiments, shown for use by mobile devices, the present invention is not intended to be limited to devices that are mobile. For example, in some embodiments, the present invention may be utilized upon a standard desktop computer, cash register, land line telephone, or any text capable device. - Additionally, in some embodiments, the
User 101 is not required for the AutomatedDictionary Population System 100. For example, if theDictionary System 110 receives text data from theExternal Wireless Network 103, theDictionary System 110 may perform dictionary population without receiving input from theUser 101. -
Interface 111 may be a keypad, touch screen, stylus pad, or any input device. Additionally, in some embodiments,Interface 111 may also provide an output such as a screen or sound output. Alternate systems of input and output may be utilized by theInterface 111 as is well known by those skilled in the art. TheInterface 111 facilitates input from theUser 101 to theProcessor 117. - Messages provided by the
User 101 through theInterface 111 may be stored by theMessage Storage 113. Also, messages received by the mobile device from theExternal Wireless Network 103 via theWireless Connector 121 may, likewise, be stored by theMessage Storage 113. TheMessage Storage 113 may additionally be referred to as an ‘inbox’ or similar term. TheMessage Storage 113 is of finite size, although that size may be very large in a modern mobile device. Messages may be deleted when theUser 101 has no further need of them or may be deleted automatically when a time limit is reached. Regardless of the actual mechanism,Message Storage 113 contents may be regarded as temporary in nature. - It may be possible, in some embodiments, to perform dictionary population upon receipt of the message and thereby minimize or eliminate the need for the
Message Storage 113. However, in some alternate embodiments, particularly when the message is one received from theExternal Wireless Network 103, such as an email, it may be desirous to delay database population until theUser 101 reads the message and is available to provide feedback if necessary. An additional benefit is realized by retaining the message received in that a response to any particular email, for example, may be biased toward the language and word use in that received message. - Much of the discussion contained herein will refer to text as words containing letters from the Roman alphabet. The discussion and examples utilizing Roman alphabet letters is purely exemplary in nature. The present invention is intended to also extend to alternate languages where symbols, glyphs or characters are strung together to produce text. For example, in Chinese a particular string of traditional ideographic symbols, known as the Zhuyin or BoPoMoFo alphabet, may be compiled as to create a character. In Japanese, beyond the ideographic Kanji characters lie a pair of syllabaries called the Kana, and these too are covered by the present invention. Likewise, the present invention may extend to standard Romanization systems, such as Pinyin for Mandarin. It will be seen that the exemplified system and method for dictionary generation is versatile enough to apply not only to Roman alphabets, but any language's symbology.
- Likewise, much of the present discussion contained herein will refer to messages as written text. The discussion and examples utilizing written text is purely exemplary in nature. The present invention is intended to also extend to any communication medium including voice, embedded audio in video feeds, email and text messages. For example, increasingly when a user is unavailable to take a voice call, instead of simply recording the caller's message, services are now provided whereby the recorded voice may be rendered as a short text message and relayed to the recipient. Such commercial services are offered by SpinVox and described in their corporate description. This has the considerable benefit to the user in that relevant information may be quickly available without the attendant interruption of the voice call. This invention may monitor the short text message storage such as the ‘inbox’ and after extracting words that are not already found in the dictionary structure may add them to the dictionary structure.
- The
Dictionary Set 115 may include the static root, or first, dictionary as well as user populated dictionaries, including the supplemental word list, i.e. the dictionaries being populated by the present invention. The supplementary word lists may be stored as a single list which may be considered to be a ‘used word’ list. Otherwise, these supplementary word lists may be stored as one or more separate word lists, each having a reference entry that allows access to these particular lists only during text exchanges which use at least some of the terms or words stored therein. Thus, for example, a message sent to John Smith could search not only the main dictionary and the personal word list, but also a used word list and a list of words used in messages received from John Smith. In some implementations, a list of all received words is kept and is accessible from any application where text entry is used. Details of the architecture of theDictionary Set 115 will be provided below. - Duplication of words is wasteful; storing the same word more than once outside the main dictionary is not necessary. However, by storing a duplicate word or reference to a word outside the main dictionary, an advantage is that the word is accessed earlier because it has become more frequently used than might be implied from the main dictionary. It is thus beneficial to store pointers to words in order to control memory usage, and also allowing phrases to be constructed by directing to particular words regardless of their actual location.
- The
Processor 117 may perform the analysis and computations required to populate theDictionary Set 115. Upon initial startup, theProcessor 117 may sequentially read each message and extract every word contained in these messages. This extracted word list is then stored as a supplementary dictionary list in theDictionary Set 115. Each time thereafter, when a message is received, the text from that received message is extracted and parsed and the words are added to this dictionary. In some cases, words will be repeats of those already stored in the main dictionary. Details of the architecture of theProcessor 117 will be provided below. - In the GSM short message service (SMS) there are several methods of handling received messages. Normal messages which contain displayable text may be presented for the
User 101 on command, and read in the normal fashion. Other messages may be sent which contain machine level instructions for the receiving device and allowUser 101 action to cause certain transactions that are not normal messaging transactions. This invention is mainly concerned with readable messages intended for theUser 101. It is also the case that electronic mail has the same essential characteristics; and, in fact, any messaging application can be treated in the same way by the AutomatedDictionary Population System 100. - When a message is opened to be read by the
User 101, the message may be parsed and a temporary list of words may be created. Each word is tested to see if it is already stored in the used word dictionary. Since there is no need to duplicate the word if it has already been stored in the used word dictionary, such repeat words may be discarded. If a word is not found in the used word dictionary, it may be appended to the list so that the list extends downwards with the last entries at the end. This feature may be beneficially used to search recent entries. - In some embodiments, the Automated
Dictionary Population System 100 may be enabled to group phrases so that components of terms of art may be stored. Especially, medical terms and legal terms routinely use word groups; as an example, consider terms such as res ipsa loquitur and mutatis mutandis where neither term is best stored as separate parts. Although each term may be fabricated from a string of single words, it is advantageous if the words that make up the terms are linked. Medical terms are notoriously lengthy and similarity between words may convey entirely the wrong information. In this case, linkage between words may be even more beneficial. -
FIG. 2 shows a logical block diagram of theDictionary Set 115 for the AutomatedDictionary Population System 100 ofFIG. 1 . TheDictionary Organizer 201 may provide organization for theDictionary Set 115 as well as coupling theDictionary Set 115 to the other components of theDictionary System 110, as illustrated by aCloud 200. TheDictionary Set 115 may also include aStatic Dictionary 211, aUsed Word List 213, aSupplemental Dictionary 215, and aProfanity List 217. In some embodiments, more or fewer dictionary partitions may be included within theDictionary Set 115. Likewise, each dictionary within theDictionary Set 115 may be further subdivided into sub-dictionary lists. For example, as previously noted, theSupplemental Dictionary 215 may be divided into multiple supplemental word lists, accessible only when addressing a particular recipient or when discussing terms found in such a list. - The
Static Dictionary 211 may be referred to as the first dictionary, root dictionary, original dictionary, or base word list. The content ofStatic Dictionary 211 is typically preloaded by the manufacturer of the mobile device. Also, theStatic Dictionary 211 is typically not amendable by theUser 101. TheStatic Dictionary 211 may be formulated from a substantial corpus in the target language, and may contain any number of words, dependent upon manufacturer desires and availability of storage resources. However, in many current mobile devices, theStatic Dictionary 211 may include a corpus of approximately 10,000 to 100,000 words on average. - The
Used Word List 213 may be populated by words that have been used by theUser 101 or received by theDictionary System 110 via theExternal Wireless Network 103. TheUsed Word List 213 may then be appended as additional words are received. TheUsed Word List 213 may have an associated ordering algorithm. In some embodiments, words are not duplicated within theUsed Word List 213 andStatic Dictionary 211. Instead, a reference is placed within theUsed Word List 213 to the word found in theStatic Dictionary 211. Likewise, multiple usages of particular words will not result in duplication within theUsed Word List 213, but rather, each word within theUsed Word List 213 may include a counter to track frequency of use. Such usage tracking may be utilized to provide predictions of words to theUser 101 during message composition. Frequency and Recency are the two elements that may be used to force an order to the assorted lists. These two elements are both embodied in the concept of ‘likelihood’. Usage frequency need not be any absolute numerical value. In some embodiments, it suffices to store data representative of relative frequency. In the minimal form, list ordering may be used to imply relative frequency. Moreover, since recency is also a valuable index of likelihood, this too may be used as a parameter. - The
Supplemental Dictionary 215, as used in this application, may be a particular type of used word list. As such, in some embodiments, theUsed Word List 213 andSupplemental Dictionary 215 may, in fact, be one and the same. However, due to the particular structure desired for theSupplemental Dictionary 215, in some embodiments, it has been distinguished as a separate component of theDictionary Set 115. For example, it may be beneficial to separate the organization of certain words based on the symbol set or font detail. TheSupplemental Dictionary 215 enables preservation of theStatic Dictionary 211 whilst permitting a personal list of items such as proper names or terms of art relevant to aparticular User 101 to be stored. The method of dictionary population disclosed by this invention involves the generation and promulgation of theSupplemental Dictionary 215. - The
Supplemental Dictionary 215, as noted, may be stored as a single list. Otherwise, theSupplemental Dictionary 215 may be stored as one or more separate word lists each having a reference entry that allows certain ones of these lists to be accessed only with text exchanges which use these terms or words. - The
Dictionary Set 115 may also include aProfanity List 217. TheProfanity List 217 enables profanities and expletives to be identified. Profanities may be determined by community, or target consumer standards. Profanities may include words and phrases native to the user's language, as well as commonly used slang or foreign profanities. In some embodiments, context of the word may likewise be analyzed to determine if its usage is deemed profane. The AutomatedDictionary Population System 100 may then resolve the use of the profanity whereby theUser 101 is not overly inconvenienced, or offended. - The
Dictionary Set 115 may also include a Frequently MisspelledWord List 219. The Frequently MisspelledWord List 219 enables identification of misspelled words so that these words are not used to populate the dictionary. Although, not addressed specifically by this invention, the difficulty caused by improper spelling may be resolved through the use of the Frequently MisspelledWord List 219 and dictionary error distance calculations. Error distance may be calculated for words, and those which have low error distances may be used to estimate which candidates are most likely to have been intended. Although this may prove disruptive to a user in the early stages, a simple query may be presented that allows the removal of erroneously stored words. This may be resolved simply by marking the word or word group when they are retrieved as candidates. For example the misspelled word recieve would appear italicized or otherwise distinguished in addition to the correctly spelled word. Selection of a seemingly misspelled word would confirm its probable valid status and promote its likelihood of retrieval whereas non-selection would demote it. Automatic removal is possible but must be approached with great care. Capitalized words, in some embodiments, should not be routinely eliminated. -
FIG. 3 shows a logical block diagram of theProcessor 117 for the AutomatedDictionary Population System 100 ofFIG. 1 . TheCoupler 301 may couple theProcessor 117 to the other components of theDictionary System 110, as illustrated by theCloud 200. TheProcessor 117 may additionally include aWord Extractor 311, aDictionary Comparer 313, aProfanity Interrupter 315, aStatistical Engine 317 and a Word Storage Moderator 319. - The
Word Extractor 311 parses the messages and extracts words, where theDictionary Comparer 313 then compares the extracted words to those already stored within theDictionary Set 115. If a profanity is identified, theProfanity Interrupter 315 may perform an interruption to resolve the profanity. - The
Statistical Engine 317 may provide word prediction during text entry, as well as the ability to determine phrases through identification of joined words. - The Word Storage Moderator 319 may direct the storage of new words within the
Dictionary Set 115. -
FIG. 4 shows a logical block diagram of theWord Extractor 311 for theProcessor 117 ofFIG. 3 . TheWord Extractor 311 may include aRetriever 411 and aMessage Parser 413 coupled to one another. Likewise, theRetriever 411 andMessage Parser 413 may couple to the other components of theProcessor 117, as illustrated by theCloud 400. - The
Retriever 411 may retrieve messages from theMessage Storage 113 for analysis. Retrieval may be automated by a trigger, or by timing. For example, retrieval may occur when theUser 101 opens a message for viewing. In this way theDictionary System 110 may gain feedback from theUser 101 in instances where clarification is desired. In some embodiments, message processing may be deferred when available power is below a certain threshold and a large amount of data may be present for processing. In such an instance, a particular message may be saved for later ifUser 101 feedback is desired. Dispute resolution may then be achieved through user intervention. - The
Message Parser 413 may parse the message into individual words for the extraction. In some embodiments, theMessage Parser 413 may also be configured to identify indicators of a phrase. In these embodiments, theMessage Parser 413 may parse the individual words of the suspected phrase, as well as parse the entire intact phrase for analysis by theStatistical Engine 317. -
FIG. 5 shows a logical block diagram of theStatistical Engine 317 for theProcessor 117 ofFIG. 3 . TheStatistical Engine Coupler 501 may couple theStatistical Engine 317 to the other components of theProcessor 117, as illustrated by theCloud 400. TheStatistical Engine 317 may additionally include aPhrasing Analyzer 511, aReferencer 513, aWord Frequency Tracker 515, aRecipient Analyzer 517 and aPredictor 519 each coupled to one another. TheWord Frequency Tracker 515 may include tracking word use frequency and word recency. - The
Phrasing Analyzer 511 may take the parsed language generated by theMessage Parser 413 and identify the phrases. ThePhrasing Analyzer 511 may also link particular words for later predictive processes. - The
Predictor 519 predicts words for the creation of candidate word lists. In some embodiments, thePredictor 519 may use fuzzy logic in order to select the candidate word lists. Fuzzy logic is derived from fuzzy set theory dealing with reasoning that is approximate rather than precisely deduced from classical predicate logic. It can be thought of as the application side of fuzzy set theory dealing with well thought out real world expert values for a complex problem. - The
Referencer 513 may reference words already located within theDictionary Set 115, thereby eliminating the need for duplicate storage of words. Likewise, theWord Frequency Tracker 515 may keep track of the frequency of word usage. Again, by tracking frequency, multiple uses of a single word will still result in a single word entry within theDictionary Set 115, thus saving storage resources. Also, the frequency of word use may be utilized by thePredictor 519 to generate candidate lists for theUser 101 during text entry. TheWord Frequency Tracker 515 may compile simple indicia of a word's gross usage and recency. By appending to a list, recency steps occur naturally, since when indexed from the end, backwards, the most recent words are identified. If a word occurs duplicatively, earlier in the process (closer to the front) this word can be de-referenced and, at a convenient time, the list may be shuffled or compacted to eliminate the earlier instance of a recent word. If needed, the list may be augmented by keeping a note of how often the word has been used. - In some embodiments, the
Word Frequency Tracker 515 may provide a more detailed and useful analysis of frequency. For example, more advanced versions of theWord Frequency Tracker 515 may provide word frequency use when the message is directed toward a particular recipient. Likewise, in some embodiments, theWord Frequency Tracker 515 may generate multiple frequency indicia for a word, dependent upon the preceding word(s), general message content, sentence grammar or other variable. In this way, theWord Frequency Tracker 515 may generate a rich set of frequency statistics for a more refined, and ultimately more useful, word prediction by thePredictor 519. Complexity of theWord Frequency Tracker 515 may depend upon manufacturer's desires, and may consider storage and computation resources available to theDictionary System 110. - The
Recipient Analyzer 517 may analyze message recipient to generate data regarding word usage frequency by, or to, each recipient, and to also aid in the generation of recipient specific supplemental word lists of theSupplemental Dictionary 215. -
FIG. 6 shows a logical block diagram ofPhrasing Analyzer 511 for theStatistical Engine 317 ofFIG. 5 . ThePhrasing Analyzer 511 may include aPhrase Group Identifier 611 and aLinker 613 coupled to one another. Likewise, thePhrase Group Identifier 611 andLinker 613 may couple to the other components of theStatistical Engine 317, as illustrated by theCloud 600. - The
Phrase Group Identifier 611 may identify words strings which form phrases. TheLinker 613 may provide links for the words of the phrase so that the actual phrase need not be stored in its entirety. - When a group of adjacent words is parsed from the text message, if none are to be found in the main dictionary, they may be stored with additional information that allows them to remain linked. Two or more adjacent words may form a phrase or term of art or an associated word group. Words which precede or follow the group will generally be found in the main dictionary. Phrases may also be identified by explicit mean such as capitalization, quotation marks surrounding the phrase, by underlining or marking in a distinct way. This latter is common in Chinese; for example, where characters that are intended to be read as a single phrase, such as a name, may be underlined and thus conjoined. In alphabetic based languages, it is common to find joining words such as of or in, used along with capitalized words. For example, the
Phrasing Analyzer 511 may receive Cost of Goods or Moreton-in-Marsh from theMessage Parser 413, and thePhrasing Analyzer 511 may be configured to identify these word groups as related or associated word structures. These associations between words may be stored in a way that enables them to be easily recalled by theUser 101. - Moreover, in certain dialects, the word “and” is a strong joining word feature. For example, the Cockney dialect of English has a strong “rhyming slang” format; “apples and pears” is used to substitute for “stairs” whereas “trouble and strife” would be used to mean “wife”. By using semantic rules, it may be possible to detect relationships of this nature between words in a message. In some embodiments, any capitalized words separated by known “joining” words may be treated as a group.
- Yet another form of entry, acronym and abbreviation, is identifiable by such word association. Common business terms are frequently referenced in acronymic form where the full name is several words long; thus COGS could be entered, and the phrase Cost of Goods returned. Another common example would be FAQS for Frequently asked Questions. The use of the “S” at the end of an acronym is often either redundant, being used as an aid to pronunciation, or used to denote a plural form and is a known case where it may be safely discarded in the matching process since if a full match is possible, it will occur in any case.
-
FIG. 7 shows an illustration of a user interaction with a wireless mobile device, shown generally at 700. In this exemplary illustration, theUser 101 is seen interacting with aDictionary System 110, which is, in this exemplary illustration, a mobile device. TheDictionary System 110, as embodied in the mobile device, includes aDisplay 713,Keypad 715 andMicrophone 717, which collectively comprise theInterface 111 of theDictionary System 110. TheKeypad 715 in the exemplary illustration may include a non-deterministic, or ambiguous, keypad, or a deterministic style keypad. TheDictionary System 110 may be coupled, wirelessly, to theExternal Wireless Network 103 via aWireless Receiver 705. In some embodiments, theWireless Receiver 705 may include a Bluetooth adapter, radio tower, access point, or any other wireless signal intermediary. - It should be noted that the
Dictionary System 110 may rely upon a wired connection to couple to theExternal Wireless Network 103. The intent of these exemplary illustrations, as seen inFIG. 7 , is to show an exemplary variety of device configurations that the AutomatedDictionary Population System 100 is designed for. -
FIG. 8 shows an illustration of anAmbiguous Style Keypad 800 associated with many mobile devices. Such aKeypad 800 may be often found upon phones and other devices with limited key space. In anambiguous Keypad 800 eachNumerical Key Numeral Letters Letters non-numeric Keys - The
Ambiguous Keypad 800 may rely upon the number of times anyparticular Numerical Key Ambiguous Keypad 800. -
FIG. 9 shows an illustration of theDeterministic Keypad 715, or “full” keyboard, wherein the numerical inputs share a physical key with alphabetical inputs. TheDeterministic Keypad 715 has one symbol per letter in the Latin set, and 12 keys are labeled with thenumbers 0 through 9 and the characters * and # to correspond with the normal touch tone keys. - In this exemplary
Deterministic Keypad 715,Dualistic Keys - III. Methods of Dictionary Population
-
FIG. 10 shows a flow chart illustrating a process of automated dictionary population, shown generally at 1000. The process begins and then progresses to step 1010 where the message is received. Messages may be received through theUser 101 inputting a message on theDictionary System 110. Also, theDictionary System 110 may receive a message from theExternal Wireless Network 103, such as an email or SMS. - The process then progresses to step 1020 where the message is stored. While it is conceivable that the Automated
Dictionary Population System 100 may process messages upon receipt, thereby eliminating the need to store the message, it may be desirous to store the message until theUser 101 interacts with the message so that theUser 101 may be queried for feedback when necessary. TheMessage Storage 113 may store the message. - The process then progresses to step 1030 where the message is processed for dictionary population. The
Processor 117 may perform the processing of the message. The details of message processing will be described in more detail below. Then, atstep 1040, the words populating the dictionary may be recalled. This may occur during predictive word presentation as a candidate word to theUser 101 during text input by theUser 101. Prediction of words may utilize thePredictor 519. The process then ends. -
FIG. 11 shows a flow chart illustrating a process of message processing, shown generally at 1030. The process begins fromstep 1020 ofFIG. 10 . The process then progresses to step 1101 where words are extracted from the message. Extraction may be performed by theWord Extractor 311. The process then progresses to step 1109 where the extracted words are compared against the words preexisting within theDictionary Set 115. This function may be performed by theDictionary Comparer 313. - The process then progresses to step 1104 where slang and misspelling is resolved through comparison to the Frequently Misspelled
List 219. Additionally, dictionary error distance may be calculated for words, and those which have low error distances may be used to estimate which candidates are most likely to have been intended. Although this may prove disruptive to a user in the early stages, a simple query may be presented that allows the removal of erroneously stored words. This may be resolved simply by marking the word or word group when they are retrieved as candidates. - The process then progresses to step 1105 where an inquiry is made as to whether the word is found within the
Dictionary Set 115. If the word is not yet stored within one of the dictionaries of theDictionary Set 115, the process then progresses to step 1111 where the word is stored within theSupplemental Dictionary 215 by the Word Storage Moderator 319. Then, atstep 1113, statistical analysis may be performed upon the newly stored word. Statistical analysis may utilize theStatistical Engine 317, and may include frequency analysis, sender analysis and additional statistical measures. The process then concludes by progressing to step 1040 ofFIG. 10 . - Else, if at
step 1105, the word is found within theDictionary Set 115, the process then progresses to step 1107 where an inquiry is made as to whether the word is found within theProfanity List 217. If the word is a profanity, the process then progresses to step 1109 where a profanity interruption process may be performed by theProfanity Interrupter 315. The process then concludes by progressing to step 1040 ofFIG. 10 . - Otherwise, if at
step 1107 the word does not match an entry of theProfanity List 217, the process then progresses to step 1113 where statistical analysis may be performed upon the previously stored word. Statistical analysis may utilize theStatistical Engine 317, and may include frequency analysis, sender analysis and additional statistical measures. The process then concludes by progressing to step 1040 ofFIG. 10 . -
FIG. 12 shows a flow chart illustrating a process of word extraction, shown generally at 1101. The process begins fromstep 1020 ofFIG. 10 . The process then progresses to step 1201 where the message is retrieved from storage within theMessage Storage 113. Retrieval may utilize theRetriever 411. In some embodiments, retrieval may be initiated when there is a triggering event, such as connection of theDictionary System 110 to an external power source, or the opening of a message by theUser 101. - The process then progresses to step 1203 where the message is parsed for individual words. Parsing may utilize the
Message Parser 413. The process then concludes by progressing to step 1103 ofFIG. 11 . -
FIG. 13 shows a flow chart illustrating a process of profanity interruption, shown generally at 1109. A common problem with personal communications is that the informality leads to the propagation of written messages whose content may be profane or laden with expletives. Seemingly sane users may cast caution to the wind, assuming that the message is private and will not be shared. The consequence of this invention is that obscenities may be gathered unwittingly and lead to embarrassment if someone other than the owner attempts to use the appliance. - The process begins from
step 1107 ofFIG. 11 . The process then progresses to step 1301 where some portion of the profanity is replaced by some place marker. In some embodiments, all but the first letter of the profanity may be replaced by asterisks. In some alternate embodiments, only vowels are replaced. Place markers may be any symbol desired, such as the pound symbol (#), an asterisk symbol (*), exclamation marks (!) or any other desired symbol. This modified profanity may then be displayed to theUser 101 atstep 1303. - The intent in modifying the profanity in this way is to avoid offending the
delicate User 101. However, theUser 101 may have intended to use the profanity, so it is equally important that this word selection be provided in a candidate word listing. Moreover, by modifying the profanity, theUser 101 may rethink its usage, and avoid flippant use of words which may cause interpersonal or business relationship harm. TheUser 101 may be prompted for an action atstep 1305. - The process then progresses to step 1307 where an inquiry is made as to whether the
User 101 explicitly selects the modified profanity as the intended word. If the user makes such an explicit selection of the modified profanity, the word may be shown to theUser 101 as an unmodified word atstep 1309. The profanity may also be added to theUsed Word List 213 atstep 1311. In some embodiments, the word may be linked to a particular recipient, so that in future uses of the word it will still be treated as a profanity in most scenarios, but be treated as a regular word when used with “familiar” or “informal” contacts. The process then concludes by progressing to step 1040 ofFIG. 10 . - Else, if at
step 1313 theUser 101 does not explicitly select the profanity from the candidate word listing, the modified profanity may be removed from the candidate word listing. The process then concludes by progressing to step 1040 ofFIG. 10 . -
FIG. 14 shows a flow chart illustrating a process of statistical analysis of words, shown generally at 1113. The process begins fromstep FIG. 11 . The process then progresses to step 1401 where thePhrasing Analyzer 511 analyzes the message for word groups. - The process then progresses to step 1403 where word use likelihood, including frequency and recency, may be indexed by the
Word Frequency Tracker 515. By indexing word use, the AutomatedDictionary Population System 100 eliminates the need to repetitively store multiple copies of a particular word in theDictionary Set 115. Also, these indices may be of particular use in the generation of predictive candidate word lists. Frequency tracking may be a simple count of word use, or may, in some embodiments, involve more sophisticated tracking of word use by sentence structure, message content, proximate words, or intended recipient. - One such index is illustrated at
step 1405, where word use likelihood is indexed by recipient. The verbiage utilized when speaking to one's lover, mother, friend or business associate may vary greatly. By linking word use frequency by recipient, predictive candidate lists may be more finely tuned when writing a message to a known recipient. - The process then progresses to step 1407 where language is analyzed for affect; that is, the emotional effect invoked by the message. Particular words or phrases may be identified which denotes particular emotional response. Likewise, particular grammar may also denote mood of the message. For example, speech patterns directed to a teen friend, versus a parent or employer may be identified. Monitoring affection in the language may be particularly useful in generation of candidate word lists. The process then concludes by progressing to step 1040 of
FIG. 10 . -
FIG. 15 shows a flow chart illustrating a process of analysis for word groups, shown generally at 1401. The process begins fromstep FIG. 11 . The process then progresses to step 1501 where thePhrase Group Identifier 611 identifies phrase groups. As noted earlier, when a group of adjacent words is parsed from the text message, if none are to be found in the main dictionary, they may be stored with additional information that allows them to remain linked. Two or more adjacent words may form a phrase or term of art or an associated word group. Words which precede or follow the group will generally be found in the main dictionary. Phrases may also be identified by explicit mean such as capitalization, quotation marks surrounding the phrase, by underlining or marking in a distinct way. Likewise, certain words are considered to be “joining” words. It is not unusual to have related words located either side of the joining words. Examples would be “of” and “in”. Additionally, acronyms which stand for a particular phrase (such as FAQS) may likewise be identified. - The process then progresses to step 1503 where the
Linker 613 links the words identified as a phrase. By providing linking indicators, the AutomatedDictionary Population System 100 minimizes the need to store each phrase separately. Instead, where the phrase includes words found in theDictionary Set 115, each of the already stored words may further include links to generate the phrase. This enables conservation of storage resources. The process then concludes by progressing to step 1403 ofFIG. 14 . -
FIG. 16 shows a flow chart illustrating a process of identifying phrase groups, shown generally at 1501. The process begins fromstep FIG. 11 . The process then progresses to step 1601 where adjacent words not found in theDictionary Set 115 are identified as a potential phrase group. If additional indications of a phrase are present, such as capitalization, quotes or joining words, then the system may automatically save the word string as a phrase. If there are no other indications that the word string is a phrase, the system may query theUser 101 to resolve the ambiguity. - The process then progresses to step 1603 where capitalized phrases are identified. Likewise, at
step 1605 quoted phrases are identified; and atstep 1607 italicized phrases are identified. The process then progresses to step 1609 where semantic rules may be utilized to determine phrases. Such semantic analysis may include identifying “joining words” and particular rhyme or cadence associated with phrases. Often theUser 101 may be queried to resolve ambiguities on whether a particular set of words includes a phrase. - The process then progresses to step 1611 where common abbreviations and acronyms which designate phrases are identified. As noted, these are common in business settings; however, such “shorthand” is likewise becoming increasingly common during casual messaging with terms such as “lol”, “bff” and “cul8tr”. The process then concludes by progressing to step 1403 of
FIG. 14 . - In sum the present invention relates generally to automated dictionary generation system and method to provide fast, accurate and resource efficient population of personalized dictionaries. Additionally, this rapid dictionary population enhances early use of a mobile device, provide comprehensive profanity protection and aids in rapid text input on a mobile device. In this way the automated dictionary generation system and method may provide an invaluable tool for device manufacturers and device users.
- While this invention has been described in terms of several preferred embodiments, there are alterations, modifications, permutations, and substitute equivalents, which fall within the scope of this invention. For example, the present invention may be embodied as all software, all hardware, or some combination thereof. Although sub-section titles have been provided to aid in the description of the invention, these titles are merely illustrative and are not intended to limit the scope of the present invention.
- It should also be noted that there are many alternative ways of implementing the methods and apparatuses of the present invention. It is therefore intended that the following appended claims be interpreted as including all such alterations, modifications, permutations, and substitute equivalents as fall within the true spirit and scope of the present invention.
Claims (18)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/300,174 US9396178B2 (en) | 2008-06-06 | 2014-06-09 | Systems and methods for an automated personalized dictionary generator for portable devices |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/135,142 US8180630B2 (en) | 2008-06-06 | 2008-06-06 | Systems and methods for an automated personalized dictionary generator for portable devices |
US13/434,730 US8386241B2 (en) | 2008-06-06 | 2012-03-29 | Systems and methods for an automated personalized dictionary generator for portable devices |
US13/772,139 US8781816B2 (en) | 2008-06-06 | 2013-02-20 | Systems and methods for an automated personalized dictionary generator for portable devices |
US14/300,174 US9396178B2 (en) | 2008-06-06 | 2014-06-09 | Systems and methods for an automated personalized dictionary generator for portable devices |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/772,139 Continuation US8781816B2 (en) | 2008-06-06 | 2013-02-20 | Systems and methods for an automated personalized dictionary generator for portable devices |
Publications (2)
Publication Number | Publication Date |
---|---|
US20140288924A1 true US20140288924A1 (en) | 2014-09-25 |
US9396178B2 US9396178B2 (en) | 2016-07-19 |
Family
ID=41398583
Family Applications (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/135,142 Expired - Fee Related US8180630B2 (en) | 2008-06-06 | 2008-06-06 | Systems and methods for an automated personalized dictionary generator for portable devices |
US13/434,730 Active US8386241B2 (en) | 2008-06-06 | 2012-03-29 | Systems and methods for an automated personalized dictionary generator for portable devices |
US13/772,139 Active US8781816B2 (en) | 2008-06-06 | 2013-02-20 | Systems and methods for an automated personalized dictionary generator for portable devices |
US14/300,174 Active US9396178B2 (en) | 2008-06-06 | 2014-06-09 | Systems and methods for an automated personalized dictionary generator for portable devices |
Family Applications Before (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/135,142 Expired - Fee Related US8180630B2 (en) | 2008-06-06 | 2008-06-06 | Systems and methods for an automated personalized dictionary generator for portable devices |
US13/434,730 Active US8386241B2 (en) | 2008-06-06 | 2012-03-29 | Systems and methods for an automated personalized dictionary generator for portable devices |
US13/772,139 Active US8781816B2 (en) | 2008-06-06 | 2013-02-20 | Systems and methods for an automated personalized dictionary generator for portable devices |
Country Status (3)
Country | Link |
---|---|
US (4) | US8180630B2 (en) |
EP (1) | EP2286350B1 (en) |
WO (1) | WO2009149453A1 (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160170958A1 (en) * | 2013-10-17 | 2016-06-16 | International Business Machines Corporation | Messaging auto-correction using recipient feedback |
CN105956158A (en) * | 2016-05-17 | 2016-09-21 | 清华大学 | Automatic extraction method of network neologism on the basis of mass microblog texts and use information |
US9602449B2 (en) | 2013-10-17 | 2017-03-21 | International Business Machines Corporation | Correction of incoming messaging |
WO2021067835A1 (en) * | 2019-10-05 | 2021-04-08 | Liveramp, Inc. | System and method for email address selection |
US11379669B2 (en) * | 2019-07-29 | 2022-07-05 | International Business Machines Corporation | Identifying ambiguity in semantic resources |
Families Citing this family (220)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8645137B2 (en) | 2000-03-16 | 2014-02-04 | Apple Inc. | Fast, language-independent method for user authentication by voice |
US8117540B2 (en) * | 2005-05-18 | 2012-02-14 | Neuer Wall Treuhand Gmbh | Method and device incorporating improved text input mechanism |
US9606634B2 (en) | 2005-05-18 | 2017-03-28 | Nokia Technologies Oy | Device incorporating improved text input mechanism |
US8374846B2 (en) | 2005-05-18 | 2013-02-12 | Neuer Wall Treuhand Gmbh | Text input device and method |
US8036878B2 (en) * | 2005-05-18 | 2011-10-11 | Never Wall Treuhand GmbH | Device incorporating improved text input mechanism |
US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
US8977255B2 (en) | 2007-04-03 | 2015-03-10 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
US10002189B2 (en) | 2007-12-20 | 2018-06-19 | Apple Inc. | Method and apparatus for searching using an active ontology |
US9330720B2 (en) | 2008-01-03 | 2016-05-03 | Apple Inc. | Methods and apparatus for altering audio output signals |
US8996376B2 (en) | 2008-04-05 | 2015-03-31 | Apple Inc. | Intelligent text-to-speech conversion |
US10496753B2 (en) | 2010-01-18 | 2019-12-03 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
US8180630B2 (en) * | 2008-06-06 | 2012-05-15 | Zi Corporation Of Canada, Inc. | Systems and methods for an automated personalized dictionary generator for portable devices |
EP2133772B1 (en) | 2008-06-11 | 2011-03-09 | ExB Asset Management GmbH | Device and method incorporating an improved text input mechanism |
US20100030549A1 (en) | 2008-07-31 | 2010-02-04 | Lee Michael M | Mobile device having human language translation capability with positional feedback |
US8676904B2 (en) | 2008-10-02 | 2014-03-18 | Apple Inc. | Electronic devices with voice command and contextual data processing capabilities |
US8190601B2 (en) * | 2009-05-22 | 2012-05-29 | Microsoft Corporation | Identifying task groups for organizing search results |
US10241644B2 (en) | 2011-06-03 | 2019-03-26 | Apple Inc. | Actionable reminder entries |
US9858925B2 (en) | 2009-06-05 | 2018-01-02 | Apple Inc. | Using context information to facilitate processing of commands in a virtual assistant |
US10241752B2 (en) | 2011-09-30 | 2019-03-26 | Apple Inc. | Interface for a virtual digital assistant |
US10255566B2 (en) | 2011-06-03 | 2019-04-09 | Apple Inc. | Generating and processing task items that represent tasks to perform |
US9431006B2 (en) | 2009-07-02 | 2016-08-30 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
US20110035211A1 (en) * | 2009-08-07 | 2011-02-10 | Tal Eden | Systems, methods and apparatus for relative frequency based phrase mining |
US8489390B2 (en) * | 2009-09-30 | 2013-07-16 | Cisco Technology, Inc. | System and method for generating vocabulary from network data |
US9201965B1 (en) * | 2009-09-30 | 2015-12-01 | Cisco Technology, Inc. | System and method for providing speech recognition using personal vocabulary in a network environment |
US8990083B1 (en) | 2009-09-30 | 2015-03-24 | Cisco Technology, Inc. | System and method for generating personal vocabulary from network data |
US8554854B2 (en) * | 2009-12-11 | 2013-10-08 | Citizennet Inc. | Systems and methods for identifying terms relevant to web pages using social network messages |
US10276170B2 (en) | 2010-01-18 | 2019-04-30 | Apple Inc. | Intelligent automated assistant |
US10679605B2 (en) | 2010-01-18 | 2020-06-09 | Apple Inc. | Hands-free list-reading by intelligent automated assistant |
US10553209B2 (en) | 2010-01-18 | 2020-02-04 | Apple Inc. | Systems and methods for hands-free notification summaries |
US10705794B2 (en) | 2010-01-18 | 2020-07-07 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
US8510098B2 (en) * | 2010-01-29 | 2013-08-13 | Ipar, Llc | Systems and methods for word offensiveness processing using aggregated offensive word filters |
US8296130B2 (en) | 2010-01-29 | 2012-10-23 | Ipar, Llc | Systems and methods for word offensiveness detection and processing using weighted dictionaries and normalization |
US8682667B2 (en) | 2010-02-25 | 2014-03-25 | Apple Inc. | User profiling for selecting user specific voice input processing information |
US8935274B1 (en) | 2010-05-12 | 2015-01-13 | Cisco Technology, Inc | System and method for deriving user expertise based on data propagating in a network environment |
US8738377B2 (en) | 2010-06-07 | 2014-05-27 | Google Inc. | Predicting and learning carrier phrases for speech input |
US9213986B1 (en) * | 2010-06-29 | 2015-12-15 | Brian K. Buchheit | Modified media conforming to user-established levels of media censorship |
CN102467548B (en) * | 2010-11-15 | 2015-09-16 | 腾讯科技(深圳)有限公司 | A kind of recognition methods of neologisms and system |
US8903719B1 (en) * | 2010-11-17 | 2014-12-02 | Sprint Communications Company L.P. | Providing context-sensitive writing assistance |
US8667169B2 (en) | 2010-12-17 | 2014-03-04 | Cisco Technology, Inc. | System and method for providing argument maps based on activity in a network environment |
US9465795B2 (en) | 2010-12-17 | 2016-10-11 | Cisco Technology, Inc. | System and method for providing feeds based on activity in a network environment |
US9262612B2 (en) | 2011-03-21 | 2016-02-16 | Apple Inc. | Device access using voice authentication |
US8553065B2 (en) | 2011-04-18 | 2013-10-08 | Cisco Technology, Inc. | System and method for providing augmented data in a network environment |
US8528018B2 (en) | 2011-04-29 | 2013-09-03 | Cisco Technology, Inc. | System and method for evaluating visual worthiness of video data in a network environment |
US8620136B1 (en) | 2011-04-30 | 2013-12-31 | Cisco Technology, Inc. | System and method for media intelligent recording in a network environment |
US8909624B2 (en) | 2011-05-31 | 2014-12-09 | Cisco Technology, Inc. | System and method for evaluating results of a search query in a network environment |
US10057736B2 (en) | 2011-06-03 | 2018-08-21 | Apple Inc. | Active transport based notifications |
US8886797B2 (en) | 2011-07-14 | 2014-11-11 | Cisco Technology, Inc. | System and method for deriving user expertise based on data propagating in a network environment |
US8994660B2 (en) | 2011-08-29 | 2015-03-31 | Apple Inc. | Text correction processing |
US9348808B2 (en) * | 2011-12-12 | 2016-05-24 | Empire Technology Development Llc | Content-based automatic input protocol selection |
US8831403B2 (en) | 2012-02-01 | 2014-09-09 | Cisco Technology, Inc. | System and method for creating customized on-demand video reports in a network environment |
US9330083B2 (en) * | 2012-02-14 | 2016-05-03 | Facebook, Inc. | Creating customized user dictionary |
US9330082B2 (en) * | 2012-02-14 | 2016-05-03 | Facebook, Inc. | User experience with customized user dictionary |
US9235565B2 (en) * | 2012-02-14 | 2016-01-12 | Facebook, Inc. | Blending customized user dictionaries |
US10134385B2 (en) | 2012-03-02 | 2018-11-20 | Apple Inc. | Systems and methods for name pronunciation |
US9483461B2 (en) | 2012-03-06 | 2016-11-01 | Apple Inc. | Handling speech synthesis of content for multiple languages |
US8756052B2 (en) | 2012-04-30 | 2014-06-17 | Blackberry Limited | Methods and systems for a locally and temporally adaptive text prediction |
US9275636B2 (en) * | 2012-05-03 | 2016-03-01 | International Business Machines Corporation | Automatic accuracy estimation for audio transcriptions |
US9280610B2 (en) | 2012-05-14 | 2016-03-08 | Apple Inc. | Crowd sourcing information to fulfill user requests |
US10417037B2 (en) | 2012-05-15 | 2019-09-17 | Apple Inc. | Systems and methods for integrating third party services with a digital assistant |
US9721563B2 (en) | 2012-06-08 | 2017-08-01 | Apple Inc. | Name recognition system |
US9495129B2 (en) | 2012-06-29 | 2016-11-15 | Apple Inc. | Device, method, and user interface for voice-activated navigation and browsing of a document |
US9547647B2 (en) | 2012-09-19 | 2017-01-17 | Apple Inc. | Voice-based media searching |
US8965754B2 (en) | 2012-11-20 | 2015-02-24 | International Business Machines Corporation | Text prediction using environment hints |
US9355099B2 (en) * | 2012-12-01 | 2016-05-31 | Althea Systems and Software Private Limited | System and method for detecting explicit multimedia content |
US9244905B2 (en) | 2012-12-06 | 2016-01-26 | Microsoft Technology Licensing, Llc | Communication context based predictive-text suggestion |
DE112014000709B4 (en) | 2013-02-07 | 2021-12-30 | Apple Inc. | METHOD AND DEVICE FOR OPERATING A VOICE TRIGGER FOR A DIGITAL ASSISTANT |
CN107330124A (en) * | 2013-03-11 | 2017-11-07 | 曹华诚 | Content recommendation method |
US9977779B2 (en) | 2013-03-14 | 2018-05-22 | Apple Inc. | Automatic supplementation of word correction dictionaries |
WO2014197336A1 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for detecting errors in interactions with a voice-based digital assistant |
US9582608B2 (en) | 2013-06-07 | 2017-02-28 | Apple Inc. | Unified ranking with entropy-weighted information for phrase-based semantic auto-completion |
WO2014197334A2 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
WO2014197335A1 (en) | 2013-06-08 | 2014-12-11 | Apple Inc. | Interpreting and acting upon commands that involve sharing information with remote devices |
EP3937002A1 (en) | 2013-06-09 | 2022-01-12 | Apple Inc. | Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant |
US10176167B2 (en) | 2013-06-09 | 2019-01-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
US10853572B2 (en) * | 2013-07-30 | 2020-12-01 | Oracle International Corporation | System and method for detecting the occureances of irrelevant and/or low-score strings in community based or user generated content |
US9465876B2 (en) | 2013-09-09 | 2016-10-11 | International Business Machines Corporation | Managing content available for content prediction |
US10296160B2 (en) | 2013-12-06 | 2019-05-21 | Apple Inc. | Method for extracting salient dialog usage from live data |
US9275037B2 (en) | 2014-02-18 | 2016-03-01 | International Business Machines Corporation | Managing comments relating to work items |
US9251141B1 (en) | 2014-05-12 | 2016-02-02 | Google Inc. | Entity identification model training |
US9607032B2 (en) | 2014-05-12 | 2017-03-28 | Google Inc. | Updating text within a document |
US9881010B1 (en) | 2014-05-12 | 2018-01-30 | Google Inc. | Suggestions based on document topics |
US9959296B1 (en) | 2014-05-12 | 2018-05-01 | Google Llc | Providing suggestions within a document |
CN105095182B (en) * | 2014-05-22 | 2018-11-06 | 华为技术有限公司 | A kind of return information recommendation method and device |
US9430463B2 (en) | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
US9842101B2 (en) | 2014-05-30 | 2017-12-12 | Apple Inc. | Predictive conversion of language input |
US9785630B2 (en) | 2014-05-30 | 2017-10-10 | Apple Inc. | Text prediction using combined word N-gram and unigram language models |
US10565219B2 (en) | 2014-05-30 | 2020-02-18 | Apple Inc. | Techniques for automatically generating a suggested contact based on a received message |
US10170123B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Intelligent assistant for home automation |
US10579212B2 (en) | 2014-05-30 | 2020-03-03 | Apple Inc. | Structured suggestions |
TWI566107B (en) | 2014-05-30 | 2017-01-11 | 蘋果公司 | Method for processing a multi-part voice command, non-transitory computer readable storage medium and electronic device |
US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
US10078631B2 (en) | 2014-05-30 | 2018-09-18 | Apple Inc. | Entropy-guided text prediction using combined word and character n-gram language models |
US9760559B2 (en) | 2014-05-30 | 2017-09-12 | Apple Inc. | Predictive text input |
US9633004B2 (en) | 2014-05-30 | 2017-04-25 | Apple Inc. | Better resolution when referencing to concepts |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US10659851B2 (en) | 2014-06-30 | 2020-05-19 | Apple Inc. | Real-time digital assistant knowledge updates |
US10446141B2 (en) | 2014-08-28 | 2019-10-15 | Apple Inc. | Automatic speech recognition based on user feedback |
US9818400B2 (en) | 2014-09-11 | 2017-11-14 | Apple Inc. | Method and apparatus for discovering trending terms in speech requests |
US10789041B2 (en) | 2014-09-12 | 2020-09-29 | Apple Inc. | Dynamic thresholds for always listening speech trigger |
US9646609B2 (en) | 2014-09-30 | 2017-05-09 | Apple Inc. | Caching apparatus for serving phonetic pronunciations |
US9886432B2 (en) | 2014-09-30 | 2018-02-06 | Apple Inc. | Parsimonious handling of word inflection via categorical stem + suffix N-gram language models |
US10127911B2 (en) | 2014-09-30 | 2018-11-13 | Apple Inc. | Speaker identification and unsupervised speaker adaptation techniques |
US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
US10074360B2 (en) | 2014-09-30 | 2018-09-11 | Apple Inc. | Providing an indication of the suitability of speech recognition |
US10552013B2 (en) | 2014-12-02 | 2020-02-04 | Apple Inc. | Data detection |
US9865280B2 (en) | 2015-03-06 | 2018-01-09 | Apple Inc. | Structured dictation using intelligent automated assistants |
US10152299B2 (en) | 2015-03-06 | 2018-12-11 | Apple Inc. | Reducing response latency of intelligent automated assistants |
US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
US9721566B2 (en) | 2015-03-08 | 2017-08-01 | Apple Inc. | Competing devices responding to voice triggers |
US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
US9899019B2 (en) | 2015-03-18 | 2018-02-20 | Apple Inc. | Systems and methods for structured stem and suffix language models |
US9842105B2 (en) | 2015-04-16 | 2017-12-12 | Apple Inc. | Parsimonious continuous-space phrase representations for natural language processing |
US10628006B2 (en) | 2015-05-11 | 2020-04-21 | Samsung Electronics Co., Ltd. | Electronic device and method for managing applications on an electronic device |
US10460227B2 (en) | 2015-05-15 | 2019-10-29 | Apple Inc. | Virtual assistant in a communication session |
US10083688B2 (en) | 2015-05-27 | 2018-09-25 | Apple Inc. | Device voice control for selecting a displayed affordance |
US10127220B2 (en) | 2015-06-04 | 2018-11-13 | Apple Inc. | Language identification from short strings |
US10101822B2 (en) | 2015-06-05 | 2018-10-16 | Apple Inc. | Language input correction |
US9578173B2 (en) | 2015-06-05 | 2017-02-21 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
US10255907B2 (en) | 2015-06-07 | 2019-04-09 | Apple Inc. | Automatic accent detection using acoustic models |
US11025565B2 (en) * | 2015-06-07 | 2021-06-01 | Apple Inc. | Personalized prediction of responses for instant messaging |
US10186254B2 (en) | 2015-06-07 | 2019-01-22 | Apple Inc. | Context-based endpoint detection |
US20160378747A1 (en) | 2015-06-29 | 2016-12-29 | Apple Inc. | Virtual assistant for media playback |
US10042841B2 (en) * | 2015-07-17 | 2018-08-07 | International Business Machines Corporation | User based text prediction |
US10003938B2 (en) | 2015-08-14 | 2018-06-19 | Apple Inc. | Easy location sharing |
US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
US10445425B2 (en) | 2015-09-15 | 2019-10-15 | Apple Inc. | Emoji and canned responses |
US9697820B2 (en) | 2015-09-24 | 2017-07-04 | Apple Inc. | Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks |
US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
US11587559B2 (en) | 2015-09-30 | 2023-02-21 | Apple Inc. | Intelligent device identification |
US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
US20170337923A1 (en) * | 2016-05-19 | 2017-11-23 | Julia Komissarchik | System and methods for creating robust voice-based user interface |
US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
US10409903B2 (en) * | 2016-05-31 | 2019-09-10 | Microsoft Technology Licensing, Llc | Unknown word predictor and content-integrated translator |
US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
US11227589B2 (en) | 2016-06-06 | 2022-01-18 | Apple Inc. | Intelligent list reading |
US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
DK179588B1 (en) | 2016-06-09 | 2019-02-22 | Apple Inc. | Intelligent automated assistant in a home environment |
US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
DK179049B1 (en) | 2016-06-11 | 2017-09-18 | Apple Inc | Data driven natural language event detection and classification |
DK179415B1 (en) | 2016-06-11 | 2018-06-14 | Apple Inc | Intelligent device arbitration and control |
DK179343B1 (en) | 2016-06-11 | 2018-05-14 | Apple Inc | Intelligent task discovery |
DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
US10474753B2 (en) | 2016-09-07 | 2019-11-12 | Apple Inc. | Language identification using recurrent neural networks |
US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
US11281993B2 (en) | 2016-12-05 | 2022-03-22 | Apple Inc. | Model and ensemble compression for metric learning |
CN108614810A (en) * | 2016-12-09 | 2018-10-02 | 中国移动通信集团山西有限公司 | Complain hot spot automatic identifying method and device |
US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
US11204787B2 (en) | 2017-01-09 | 2021-12-21 | Apple Inc. | Application integration with a digital assistant |
DK201770383A1 (en) | 2017-05-09 | 2018-12-14 | Apple Inc. | User interface for correcting recognition errors |
US10417266B2 (en) | 2017-05-09 | 2019-09-17 | Apple Inc. | Context-aware ranking of intelligent response suggestions |
DK201770439A1 (en) | 2017-05-11 | 2018-12-13 | Apple Inc. | Offline personal assistant |
US10726832B2 (en) | 2017-05-11 | 2020-07-28 | Apple Inc. | Maintaining privacy of personal information |
US10395654B2 (en) | 2017-05-11 | 2019-08-27 | Apple Inc. | Text normalization based on a data-driven learning network |
DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
DK201770428A1 (en) | 2017-05-12 | 2019-02-18 | Apple Inc. | Low-latency intelligent automated assistant |
US11301477B2 (en) | 2017-05-12 | 2022-04-12 | Apple Inc. | Feedback analysis of a digital assistant |
DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
DK201770432A1 (en) | 2017-05-15 | 2018-12-21 | Apple Inc. | Hierarchical belief states for digital assistants |
DK201770431A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
US20180336275A1 (en) | 2017-05-16 | 2018-11-22 | Apple Inc. | Intelligent automated assistant for media exploration |
DK179560B1 (en) | 2017-05-16 | 2019-02-18 | Apple Inc. | Far-field extension for digital assistant services |
US10403278B2 (en) | 2017-05-16 | 2019-09-03 | Apple Inc. | Methods and systems for phonetic matching in digital assistant services |
US10311144B2 (en) | 2017-05-16 | 2019-06-04 | Apple Inc. | Emoji word sense disambiguation |
US10657328B2 (en) | 2017-06-02 | 2020-05-19 | Apple Inc. | Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling |
JP7013172B2 (en) * | 2017-08-29 | 2022-01-31 | 株式会社東芝 | Speech synthesis dictionary distribution device, speech synthesis distribution system and program |
US10445429B2 (en) | 2017-09-21 | 2019-10-15 | Apple Inc. | Natural language understanding using vocabularies with compressed serialized tries |
US10755051B2 (en) | 2017-09-29 | 2020-08-25 | Apple Inc. | Rule-based natural language processing |
US10636424B2 (en) | 2017-11-30 | 2020-04-28 | Apple Inc. | Multi-turn canned dialog |
US10733982B2 (en) | 2018-01-08 | 2020-08-04 | Apple Inc. | Multi-directional dialog |
US10733375B2 (en) | 2018-01-31 | 2020-08-04 | Apple Inc. | Knowledge-based framework for improving natural language understanding |
US10789959B2 (en) | 2018-03-02 | 2020-09-29 | Apple Inc. | Training speaker recognition models for digital assistants |
US10592604B2 (en) | 2018-03-12 | 2020-03-17 | Apple Inc. | Inverse text normalization for automatic speech recognition |
US10818288B2 (en) | 2018-03-26 | 2020-10-27 | Apple Inc. | Natural assistant interaction |
US10909331B2 (en) | 2018-03-30 | 2021-02-02 | Apple Inc. | Implicit identification of translation payload with neural machine translation |
US11145294B2 (en) | 2018-05-07 | 2021-10-12 | Apple Inc. | Intelligent automated assistant for delivering content from user experiences |
US10928918B2 (en) | 2018-05-07 | 2021-02-23 | Apple Inc. | Raise to speak |
DK180171B1 (en) | 2018-05-07 | 2020-07-14 | Apple Inc | USER INTERFACES FOR SHARING CONTEXTUALLY RELEVANT MEDIA CONTENT |
US10984780B2 (en) | 2018-05-21 | 2021-04-20 | Apple Inc. | Global semantic word embeddings using bi-directional recurrent neural networks |
US11386266B2 (en) | 2018-06-01 | 2022-07-12 | Apple Inc. | Text correction |
DK179822B1 (en) | 2018-06-01 | 2019-07-12 | Apple Inc. | Voice interaction at a primary device to access call functionality of a companion device |
DK180639B1 (en) | 2018-06-01 | 2021-11-04 | Apple Inc | DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT |
DK201870355A1 (en) | 2018-06-01 | 2019-12-16 | Apple Inc. | Virtual assistant operation in multi-device environments |
US10892996B2 (en) | 2018-06-01 | 2021-01-12 | Apple Inc. | Variable latency device coordination |
US11076039B2 (en) | 2018-06-03 | 2021-07-27 | Apple Inc. | Accelerated task performance |
US11010561B2 (en) | 2018-09-27 | 2021-05-18 | Apple Inc. | Sentiment prediction from textual data |
US11462215B2 (en) | 2018-09-28 | 2022-10-04 | Apple Inc. | Multi-modal inputs for voice commands |
US10839159B2 (en) | 2018-09-28 | 2020-11-17 | Apple Inc. | Named entity normalization in a spoken dialog system |
US11170166B2 (en) | 2018-09-28 | 2021-11-09 | Apple Inc. | Neural typographical error modeling via generative adversarial networks |
US10861439B2 (en) * | 2018-10-22 | 2020-12-08 | Ca, Inc. | Machine learning model for identifying offensive, computer-generated natural-language text or speech |
US20200125639A1 (en) * | 2018-10-22 | 2020-04-23 | Ca, Inc. | Generating training data from a machine learning model to identify offensive language |
US11475898B2 (en) | 2018-10-26 | 2022-10-18 | Apple Inc. | Low-latency multi-speaker speech recognition |
US11638059B2 (en) | 2019-01-04 | 2023-04-25 | Apple Inc. | Content playback on multiple devices |
US11348573B2 (en) | 2019-03-18 | 2022-05-31 | Apple Inc. | Multimodality in digital assistant systems |
US11475884B2 (en) | 2019-05-06 | 2022-10-18 | Apple Inc. | Reducing digital assistant latency when a language is incorrectly determined |
DK201970509A1 (en) | 2019-05-06 | 2021-01-15 | Apple Inc | Spoken notifications |
US11423908B2 (en) | 2019-05-06 | 2022-08-23 | Apple Inc. | Interpreting spoken requests |
US11307752B2 (en) | 2019-05-06 | 2022-04-19 | Apple Inc. | User configurable task triggers |
US11140099B2 (en) | 2019-05-21 | 2021-10-05 | Apple Inc. | Providing message response suggestions |
US11496600B2 (en) | 2019-05-31 | 2022-11-08 | Apple Inc. | Remote execution of machine-learned models |
DK180129B1 (en) | 2019-05-31 | 2020-06-02 | Apple Inc. | User activity shortcut suggestions |
US11289073B2 (en) | 2019-05-31 | 2022-03-29 | Apple Inc. | Device text to speech |
US11074408B2 (en) | 2019-06-01 | 2021-07-27 | Apple Inc. | Mail application features |
US11360641B2 (en) | 2019-06-01 | 2022-06-14 | Apple Inc. | Increasing the relevance of new available information |
US11194467B2 (en) | 2019-06-01 | 2021-12-07 | Apple Inc. | Keyboard management user interfaces |
US11871308B2 (en) * | 2019-07-29 | 2024-01-09 | TapText llc | System and method for link-initiated dynamic-mode communications |
US11488406B2 (en) | 2019-09-25 | 2022-11-01 | Apple Inc. | Text detection using global geometry estimators |
US11645461B2 (en) | 2020-02-10 | 2023-05-09 | International Business Machines Corporation | User-centric optimization for interactive dictionary expansion |
DE102020109357A1 (en) | 2020-04-03 | 2021-10-07 | Krohne Messtechnik Gmbh | Method for evaluating the installation position of a measuring device in a system, augmented reality device and method for installing a measuring device |
US20240143293A1 (en) * | 2022-10-27 | 2024-05-02 | Vmware, Inc. | Reusing and recommending user interface (ui) contents based on semantic information |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7149970B1 (en) * | 2000-06-23 | 2006-12-12 | Microsoft Corporation | Method and system for filtering and selecting from a candidate list generated by a stochastic input method |
US8180630B2 (en) * | 2008-06-06 | 2012-05-15 | Zi Corporation Of Canada, Inc. | Systems and methods for an automated personalized dictionary generator for portable devices |
Family Cites Families (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0756933A (en) * | 1993-06-24 | 1995-03-03 | Xerox Corp | Method for retrieval of document |
US5799268A (en) * | 1994-09-28 | 1998-08-25 | Apple Computer, Inc. | Method for extracting knowledge from online documentation and creating a glossary, index, help database or the like |
US6061675A (en) * | 1995-05-31 | 2000-05-09 | Oracle Corporation | Methods and apparatus for classifying terminology utilizing a knowledge catalog |
DE69607472T2 (en) * | 1995-07-26 | 2000-08-24 | Tegic Communications Inc | SYSTEM FOR SUPPRESSING AMBIANCE IN A REDUCED KEYBOARD |
US6782510B1 (en) * | 1998-01-27 | 2004-08-24 | John N. Gross | Word checking tool for controlling the language content in documents using dictionaries with modifyable status fields |
US6115709A (en) * | 1998-09-18 | 2000-09-05 | Tacit Knowledge Systems, Inc. | Method and system for constructing a knowledge profile of a user having unrestricted and restricted access portions according to respective levels of confidence of content of the portions |
JP3717730B2 (en) * | 1999-11-02 | 2005-11-16 | セイコーインスツル株式会社 | Electronic dictionary |
US20020082868A1 (en) * | 2000-12-27 | 2002-06-27 | Pories Walter J. | Systems, methods and computer program products for creating and maintaining electronic medical records |
US7580831B2 (en) * | 2002-03-05 | 2009-08-25 | Siemens Medical Solutions Health Services Corporation | Dynamic dictionary and term repository system |
GB2396940A (en) * | 2002-12-31 | 2004-07-07 | Nokia Corp | A predictive text editor utilising words from received text messages |
US20060259543A1 (en) * | 2003-10-06 | 2006-11-16 | Tindall Paul G | Method and filtering text messages in a communication device |
US7490033B2 (en) * | 2005-01-13 | 2009-02-10 | International Business Machines Corporation | System for compiling word usage frequencies |
EP1717668A1 (en) * | 2005-04-29 | 2006-11-02 | Research In Motion Limited | Method for generating text that meets specified characteristics in a handheld electronic device and a handheld electronic device incorporating the same |
US8117540B2 (en) * | 2005-05-18 | 2012-02-14 | Neuer Wall Treuhand Gmbh | Method and device incorporating improved text input mechanism |
US20070100653A1 (en) | 2005-11-01 | 2007-05-03 | Jorey Ramer | Mobile website analyzer |
US20070100806A1 (en) | 2005-11-01 | 2007-05-03 | Jorey Ramer | Client libraries for mobile content |
US20070168354A1 (en) | 2005-11-01 | 2007-07-19 | Jorey Ramer | Combined algorithmic and editorial-reviewed mobile content search results |
US20070100650A1 (en) | 2005-09-14 | 2007-05-03 | Jorey Ramer | Action functionality for mobile content search results |
US20080076472A1 (en) * | 2006-09-22 | 2008-03-27 | Sony Ericsson Mobile Communications Ab | Intelligent Predictive Text Entry |
US7912700B2 (en) * | 2007-02-08 | 2011-03-22 | Microsoft Corporation | Context based word prediction |
-
2008
- 2008-06-06 US US12/135,142 patent/US8180630B2/en not_active Expired - Fee Related
-
2009
- 2009-06-08 WO PCT/US2009/046620 patent/WO2009149453A1/en active Application Filing
- 2009-06-08 EP EP09759611.8A patent/EP2286350B1/en not_active Not-in-force
-
2012
- 2012-03-29 US US13/434,730 patent/US8386241B2/en active Active
-
2013
- 2013-02-20 US US13/772,139 patent/US8781816B2/en active Active
-
2014
- 2014-06-09 US US14/300,174 patent/US9396178B2/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7149970B1 (en) * | 2000-06-23 | 2006-12-12 | Microsoft Corporation | Method and system for filtering and selecting from a candidate list generated by a stochastic input method |
US8180630B2 (en) * | 2008-06-06 | 2012-05-15 | Zi Corporation Of Canada, Inc. | Systems and methods for an automated personalized dictionary generator for portable devices |
US8386241B2 (en) * | 2008-06-06 | 2013-02-26 | Zi Corporation Of Canada, Inc. | Systems and methods for an automated personalized dictionary generator for portable devices |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160170958A1 (en) * | 2013-10-17 | 2016-06-16 | International Business Machines Corporation | Messaging auto-correction using recipient feedback |
US9602449B2 (en) | 2013-10-17 | 2017-03-21 | International Business Machines Corporation | Correction of incoming messaging |
CN105956158A (en) * | 2016-05-17 | 2016-09-21 | 清华大学 | Automatic extraction method of network neologism on the basis of mass microblog texts and use information |
US11379669B2 (en) * | 2019-07-29 | 2022-07-05 | International Business Machines Corporation | Identifying ambiguity in semantic resources |
WO2021067835A1 (en) * | 2019-10-05 | 2021-04-08 | Liveramp, Inc. | System and method for email address selection |
US20240070157A1 (en) * | 2019-10-05 | 2024-02-29 | Liveramp, Inc. | System and Method for Email Address Selection |
Also Published As
Publication number | Publication date |
---|---|
US20130197901A1 (en) | 2013-08-01 |
US20090306969A1 (en) | 2009-12-10 |
EP2286350A1 (en) | 2011-02-23 |
EP2286350A4 (en) | 2012-08-29 |
US20120185239A1 (en) | 2012-07-19 |
US8781816B2 (en) | 2014-07-15 |
EP2286350B1 (en) | 2018-05-23 |
US8180630B2 (en) | 2012-05-15 |
WO2009149453A1 (en) | 2009-12-10 |
WO2009149453A8 (en) | 2010-07-29 |
US9396178B2 (en) | 2016-07-19 |
US8386241B2 (en) | 2013-02-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9396178B2 (en) | Systems and methods for an automated personalized dictionary generator for portable devices | |
US9128922B2 (en) | Handheld electronic device and method for performing optimized spell checking during text entry by providing a sequentially ordered series of spell-check algorithms | |
US9195645B2 (en) | Generating string predictions using contexts | |
US8547329B2 (en) | Handheld electronic device and method for performing spell checking during text entry and for integrating the output from such spell checking into the output from disambiguation | |
US9058320B2 (en) | Handheld electronic device and method for performing spell checking during text entry and for providing a spell-check learning feature | |
US20040153975A1 (en) | Text entry mechanism for small keypads | |
JP2013117978A (en) | Generating method for typing candidate for improvement in typing efficiency | |
KR20100046043A (en) | Disambiguation of keypad text entry | |
US20060033644A1 (en) | System and method for filtering far east languages | |
EP2202612B1 (en) | Automatic language selection for improving text accuracy | |
CN102999639A (en) | Speech recognition character index based method and system for searching | |
CA2583923C (en) | Handheld electronic device and method for performing spell checking during text entry and for providing a spell-check learning feature | |
JP6221275B2 (en) | Character input program and character input device | |
CA2584444C (en) | Handheld electronic device and method for performing optimized spell checking during text entry by providing a sequentially ordered series of spell-check algorithms | |
CA2584033C (en) | Handheld electronic device and method for performing spell checking during text entry and for integrating the output from such spell checking into the output from disambiguation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: ZI CORPORATION OF CANADA, INC., CANADA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:GOUD, CORNEIL JOHN;WILLIAMS, ROLAND EMLYN;TEMPLETON-STEADMAN, WILLIAM JAMES;SIGNING DATES FROM 20080707 TO 20080806;REEL/FRAME:037688/0506 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
AS | Assignment |
Owner name: CERENCE INC., MASSACHUSETTS Free format text: INTELLECTUAL PROPERTY AGREEMENT;ASSIGNOR:NUANCE COMMUNICATIONS, INC.;REEL/FRAME:050836/0191 Effective date: 20190930 |
|
AS | Assignment |
Owner name: CERENCE OPERATING COMPANY, MASSACHUSETTS Free format text: CORRECTIVE ASSIGNMENT TO CORRECT THE ASSIGNEE NAME PREVIOUSLY RECORDED AT REEL: 050836 FRAME: 0191. ASSIGNOR(S) HEREBY CONFIRMS THE INTELLECTUAL PROPERTY AGREEMENT;ASSIGNOR:NUANCE COMMUNICATIONS, INC.;REEL/FRAME:050871/0001 Effective date: 20190930 |
|
AS | Assignment |
Owner name: BARCLAYS BANK PLC, NEW YORK Free format text: SECURITY AGREEMENT;ASSIGNOR:CERENCE OPERATING COMPANY;REEL/FRAME:050953/0133 Effective date: 20191001 |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 4 |
|
AS | Assignment |
Owner name: CERENCE OPERATING COMPANY, MASSACHUSETTS Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:BARCLAYS BANK PLC;REEL/FRAME:052927/0335 Effective date: 20200612 |
|
AS | Assignment |
Owner name: WELLS FARGO BANK, N.A., NORTH CAROLINA Free format text: SECURITY AGREEMENT;ASSIGNOR:CERENCE OPERATING COMPANY;REEL/FRAME:052935/0584 Effective date: 20200612 |
|
AS | Assignment |
Owner name: CERENCE OPERATING COMPANY, MASSACHUSETTS Free format text: CORRECTIVE ASSIGNMENT TO CORRECT THE REPLACE THE CONVEYANCE DOCUMENT WITH THE NEW ASSIGNMENT PREVIOUSLY RECORDED AT REEL: 050836 FRAME: 0191. ASSIGNOR(S) HEREBY CONFIRMS THE ASSIGNMENT;ASSIGNOR:NUANCE COMMUNICATIONS, INC.;REEL/FRAME:059804/0186 Effective date: 20190930 |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 8 |