JP2004240859A

JP2004240859A - Paraphrasing system

Info

Publication number: JP2004240859A
Application number: JP2003031181A
Authority: JP
Inventors: Sayori Shimohata; さより下畑
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2003-02-07
Filing date: 2003-02-07
Publication date: 2004-08-26

Abstract

<P>PROBLEM TO BE SOLVED: To realize a system which intelligibly paraphrases a phrase likely to require specialized knowledge. <P>SOLUTION: A dictionary database 31 is provided which associates a headword with a paraphrased word representing the headword by another expression, and indicates the words. Furthermore, a user information storage table 32 is provided which indicates whether or not a headword is to be paraphrased to a paraphrased word for each user. If text is inputted together with user information from an input part 11, a conversion part 22 refers to the dictionary database 31, and searches a headword matching an arbitrary phrase in the text. Furthermore, if a matching headword is found, the conversion part 22 refers to the user information storage table 32 to examine whether the user requires the conversion of the headword, and if required, converts the arbitrary phrase into paraphrased words. <P>COPYRIGHT: (C)2004,JPO&NCIPI

Description

【０００１】
【発明の属する技術分野】
本発明は、テキストを、ユーザに合わせて適切な言い換え語に言い換える言い換えシステムに関し、特に、ユーザのレベル（習熟度、保有する知識、身体的制限など）に応じて、外来語や専門用語を分かり易く言い換える言い換えシステムに関するものである。
【０００２】
【従来の技術】
従来、ユーザの年齢や視力、習熟度に応じて表示画面を切り替える技術があった（例えば、特許文献１参照）。
この文献では、ユーザが年齢設定を行うことで、小学生であれば使用する漢字を制限する、高齢者であれば視力測定を行わせて使用する文字の大きさを変える、といった表示画面の変更を行う技術が開示されている。
【０００３】
【特許文献１】
特開２０００−３０５７４６号公報
【０００４】
【発明が解決しようとする課題】
しかしながら、上記従来の技術では、表示される内容はシステムが設定したレベルによって一律であり、ユーザの嗜好や習熟度に応じて細かくカスタマイズすることはできなかった。
【０００５】
また、近年、コンピュータ等の分野において、外来語（カタカナ語）や専門知識を必要とする単語が頻繁に使用されるようになり、特に高齢者等では、文章の理解に支障をきたすという問題がある。このような場合、分かりにくい表現を分かり易く言い換える仕組みが必要であるが、このような言い換えシステムは実現されていなかった。
【０００６】
【課題を解決するための手段】
本発明は、前述の課題を解決するため次の構成を採用する。
〈構成１〉
見出し語と、見出し語を別の表現で表す言い換え語とを対応付けて示す辞書データベースと、任意の語句に対して、辞書データベースを参照し、一致した見出し語があった場合は、語句を言い換え語に変換して出力する変換部とを備えたことを特徴とする言い換えシステム。
【０００７】
〈構成２〉
構成１に記載の言い換えシステムにおいて、見出し語を言い換え語に言い換えるか否かを各ユーザ毎に示すユーザ情報格納テーブルと、ユーザが指定された場合、ユーザに対応したユーザ情報に基づいて、任意の語句を言い換え語に変換する変換部を備えたことを特徴とする言い換えシステム。
【０００８】
〈構成３〉
構成２に記載の言い換えシステムにおいて、ユーザが見出し語を言い換えるか否かを指定した場合、指定内容を前記ユーザ情報格納テーブルに反映させる言い換え情報設定部を備えたことを特徴とする言い換えシステム。
【０００９】
〈構成４〉
構成２または３に記載の言い換えシステムにおいて、ユーザが与えた文書中の語句が辞書データベース中の見出し語に一致した場合は、ユーザ情報格納テーブルにおいて、見出し語を言い換え不要と設定する習熟度学習部とを備えたことを特徴とする言い換えシステム。
【００１０】
〈構成５〉
構成２または３に記載の言い換えシステムにおいて、ユーザが与えた文書中の語句が辞書データベース中の言い換え語に一致した場合は、ユーザ情報格納テーブルにおいて、見出し語を言い換え要と設定する習熟度学習部とを備えたことを特徴とする言い換えシステム。
【００１１】
〈構成６〉
構成１〜５のいずれかに記載の言い換えシステムにおいて、辞書データベースと変換部に対して、通信回線を介して接続する入出力装置を設け、入出力装置は、任意の語句を言い換え要求と共に変換部に対して送信し、かつ、変換部からの言い換え結果を受信するよう構成されていることを特徴とする言い換えシステム。
【００１２】
〈構成７〉
構成１〜６のいずれかに記載の言い換えシステムにおいて、見出し語の文脈上の関係に基づく条件に対応した言い換え語を備えた辞書データベースと、任意の語句に対し、語句の文脈上の関係を示す情報に基づいて、辞書データベースを参照し、情報が文脈上の関係に基づく条件に適合する見出し語が存在した場合は、見出し語に対応した言い換え語に変換する変換部を備えたことを特徴とする言い換えシステム。
【００１３】
〈構成８〉
構成１〜７のいずれかに記載の言い換えシステムにおいて、入力された文字列を形態素解析して単語を抽出し、抽出された単語に基づいて見出し語を言い換え語に変換する変換部を備えたことを特徴とする言い換えシステム。
【００１４】
〈構成９〉
構成１〜７のいずれかに記載の言い換えシステムにおいて、文書中の任意の文字から１文字ずつ文字列を増やし、文字列を見出し語と比較し、一致した場合に、文字列を言い換え対象となる語句であると判定する変換部を備えたことを特徴とする言い換えシステム。
【００１５】
〈構成１０〉
構成１〜９のいずれかに記載の言い換えシステムにおいて、言い換え語に変換した箇所は、変換しない部分とは異なる表示を行うよう構成されたことを特徴とする言い換えシステム。
【００１６】
【発明の実施の形態】
以下、本発明の実施の形態を具体例を用いて詳細に説明する。
《具体例１》
〈構成〉
図１は、本発明の言い換えシステムの具体例１を示す構成図である。
図示のシステムはコンピュータで構成されており、入出力装置１０、処理装置２０、記憶装置３０からなる。入出力装置１０は、言い換え対象となる文書の入力や言い換え結果の出力を行う機能部であり、入力部１１、出力部１２を備えている。入力部１１は、キーボードやマウス等のポインティングデバイスによるテキスト入力処理、スキャナと文字認識処理部によるテキスト入力処理、マイクと音声認識処理などによるテキスト入力処理といったテキスト文書の入力処理を行う機能を有している。また、入力部１１は、ユーザが個別に設定する言い換え情報を入力する機能を有している。出力部１２は、ディスプレイ装置への表示、音声への変換および音声出力、ファイルへの出力を行う機能を有しており、入力部１１より入力したテキストの処理結果を出力するものである。
【００１７】
処理装置２０は、演算装置やメモリ、制御部等から構成され、ユーザの習熟度レベルを設定したり、入力部１１から入力されたテキストを別の表現に言い換えたりする処理を実行する機能を有している。
処理装置２０は、言い換え情報設定部２１、変換部２２を備えている。言い換え情報設定部２１は、入力部１１から入力された各ユーザの習熟度を設定したり、特定の単語の言い換え情報を後述するユーザ情報格納テーブル３２に格納する機能部である。
【００１８】
変換部２２は、入力部１１から入力されたテキスト中の任意の語句に対して、記憶装置３０内の後述する辞書データベース３１とユーザ情報格納テーブル３２を参照し、一致した見出し語があった場合は、ユーザ情報格納テーブル３２のユーザ情報に基づいてその語句を言い換え語に変換する機能を有しており、形態素解析部２２１、辞書検索部２２２、テキスト変換部２２３からなる。
【００１９】
形態素解析部２２１は、テキストを単語単位に分割する処理を行うための既知の形態素解析を行う処理部である。辞書検索部２２２は、形態素解析部２２１で分割された単語に対して、辞書データベース３１を参照して、見出し語に関連付けられた情報を獲得する処理を行う機能部である。テキスト変換部２２３は、辞書検索部２２２の辞書検索結果に基づいて、テキストを言い換える処理を行う機能部である。
【００２０】
記憶装置３０は、ハードディスク装置や光ディスク装置あるいは半導体メモリといった記憶装置からなり、辞書データベース３１とユーザ情報格納テーブル３２が設けられている。辞書データベース３１は、言い換え表現のデータを格納するデータベースである。また、ユーザ情報格納テーブル３２は、ユーザの習熟度情報を格納するテーブルである。
【００２１】
図２は、辞書データベース３１とユーザ情報格納テーブル３２の説明図である。
図示のように、辞書データベース３１は、見出し語格納部３１１、言い換え語格納部３１２、条件格納部３１３からなり、これらのフィールドに対応したユーザ情報格納テーブル３２の情報によって１レコードが形成されている。見出し語格納部３１１は、言い換えを必要とする語句（見出し語）を格納する。また、言い換え語格納部３１２は、各見出し語を別の表現で言い換えた語句（言い換え語）を格納する。更に、条件格納部３１３は、言い換え処理を適用するための条件を設定する情報であり、本具体例では見出し語の難易度を表している。ここで、難易度とは、単語の専門性の高さを示している。難易度が高いほど、一般に理解しにくいということになり、言い換えが必要となる。本具体例において、習熟度と難易度のレベルは対応しているものとする。更に、ユーザ情報格納テーブル３２は各ユーザの識別情報（ＩＤ）毎に、各見出し語を言い換え処理するか否かの情報を示すものである。
【００２２】
〈動作〉
図３は、具体例１の動作を示すフローチャートである。
先ず、本装置を使用するユーザは、入力部１１からユーザＩＤ等を入力するといった方法によりユーザ認証処理を行う（ステップＳ１０１）。そして、言い換え情報の設定を行うか、言い換え処理を行うかの選択を行う（ステップＳ１０２）。ステップＳ１０２において、“Ｙ”の場合は変換部２２の動作となり、“Ｎ”の場合は言い換え情報設定部２１の動作となる。
【００２３】
１）言い換え情報の設定を行う場合（ステップＳ１０２で“Ｎ”の場合）
言い換え情報の設定には、装置の基準に従って全体的なレベルを指定する方法と、単語毎に言い換えの必要があるかどうかを設定する方法がある。
【００２４】
１−１）先ず、全体的なレベルを指定して言い換え情報を設定する方法について説明する。ここでは、レベル１を初心者、レベル２を中級者、レベル３を上級者と定義していると仮定する。
【００２５】
ユーザが入出力装置１０を通して自らのレベルを選択入力すると、言い換え情報設定部２１は、ユーザ情報格納テーブル３２にこの情報を設定する。例えば、ユーザがレベル２を選択すると（ステップＳ１０３）、難易度がレベル２以上の単語、即ちレベル２およびレベル３の単語が言い換えの対象となり、言い換え情報設定部２１は、これらのレベルの見出し語に対応したユーザ情報格納テーブル３２のフィールドに、言い換えを行うことを示す情報（図２では「○」）を付与する（ステップＳ１０４）。図２において、ＩＤ１のユーザ情報格納テーブル３２は、ＩＤ１のユーザが「中級者」＝レベル２を選択した状態を示している。
【００２６】
１−２）次に、単語毎に言い換え情報を設定する方法について説明する。
ユーザのレベルにかかわらず特定の単語の言い換えを行いたい、あるいは、行いたくない場合が存在する。このような場合は、入出力装置１０を通して、単語とその単語を言い換えるかどうかの情報を入力する（ステップＳ１０３）。これにより言い換え情報設定部２１は、指定された単語のユーザ情報格納テーブル３２に、その単語を言い換えるか否かの情報を付与する（ステップＳ１０４）。図２において、ＩＤ２のユーザ情報格納テーブル３２は、ＩＤ２のユーザが「中級者」＝レベル２を選択した上で、レベル１の単語である「チョリソ」を言い換え要に、レベル２の単語である「コンフィ」を言い換え不要に設定した状態を示している。
【００２７】
２）言い換え処理を行う場合（ステップＳ１０２で“Ｙ”の場合）
言い換え処理を行う場合は、ユーザが入力部１１より言い換えを行いたいテキストを入力する（ステップＳ１０５）。これにより、変換部２２においてそのテキストに対する言い換え処理が行われ（ステップＳ１０６）、処理結果が出力部１２より画面表示等の手段によって出力される（ステップＳ１０７）。その後、更に新しいテキストを処理する場合（ステップＳ１０８）は、ステップＳ１０５に戻り、そうでなければログアウトして（ステップＳ１０９）、処理を終了する。
【００２８】
次に、上述したステップＳ１０６の言い換え処理の動作について詳細に説明する。
図４は、言い換え処理の動作を示すフローチャートである。
先ず、形態素解析部２２１が、入力されたテキストを形態素解析する（ステップＳ１１１）。次に、辞書検索部２２２が、形態素解析部２２１による形態素解析結果から１語を取り出し（ステップＳ１１２）、その単語をキーに辞書データベース３１を検索する（ステップＳ１１３）。辞書データベース３１にその単語が登録されていなければ、その単語を、変換部２２内の図示しないバッファに格納する（ステップＳ１１４、Ｓ１１５）。また、その単語が見出し語として辞書データベース３１に登録されていた場合は、そのユーザのユーザ情報格納テーブル３２の情報に従って対応する言い換え語をバッファに格納する（ステップＳ１１６）。次に、ステップＳ１１２で取り出した語がテキストの最後の単語であるかを判定し（ステップＳ１１７）、最後の単語でない場合は、ステップＳ１１２に戻り、上述したステップＳ１１２〜ステップＳ１１５（またはステップＳ１１６）の処理を繰り返す。
【００２９】
図５は、文書の一例としてレストランのメニューを示す図である。
以下、このテキストを使って、ＩＤ２のユーザが言い換え処理を行った場合の処理の流れを具体的に説明する。
【００３０】
図６は、図５中のテキストの１文を形態素解析した結果の説明図である。
図７は、作業用バッファの内容の説明図である。
図８は、言い換え処理後のメニューの説明図である。
【００３１】
言い換え処理を行う場合、先ず、図６の形態素解析結果（ＴＸ６１）から「チョリソ」を読む。次に、図２に示すように、辞書データベース３１を検索すると、ＩＤ２の「チョリソ」のフィールドが「○」、即ち、言い換え要と設定されていることが分かる。そこで、「チョリソ」の言い換え語である「辛口ソーセージ」をバッファに格納する。即ち、図７のＴＸ７１「辛口ソーセージ」がバッファに格納される。
【００３２】
尚、ＩＤ２のユーザは習熟度のレベルが２なので、「チョリソ」は、ユーザの習熟度のレベルより低い難易度レベルの語であるが、ユーザ情報格納テーブル３２の情報が優先されて言い換えを行うことになる。
【００３３】
次に、図６に示す形態素解析結果から「入り」を読み、同様に辞書データベース３１を検索する。辞書データベース３１には「入り」は登録されていないので、「入り」をそのままバッファに登録する（図７のＴＸ７２の状態）。そして、同様の処理を「空豆」〜「コンフィ」に対しても行う。
【００３４】
図７におけるＴＸ７３は、図６のＴＸ６１の文に対して言い換え処理が終了した時点のバッファの内容を示している。即ち、ＴＸ６１における「ラグー」「ポワブロン」が言い換え処理され、「コンフィ」は、ＩＤ２のユーザは言い換え不要であるため、そのまま出力されている。このような言い換え処理を、図５に示したメニュー全ての文に対して行った結果が図８に示す状態である。このような入力テキストの全ての文に対して言い換え処理が終了すると、テキスト変換部２２３は、その処理結果を出力部１２に送り、出力部１２はその処理結果を画面などに表示する。
【００３５】
〈効果〉
以上のように、具体例１によれば、見出し語に対する言い換え語を示す辞書データベース３１を設け、この辞書データベース３１に基づいて、入力文中の語いを言い換え語に変換するようにしたので、例えば専門用語のように、分かりにくい表現を、ユーザの習熟度に応じて分かり易い表現に言い換えることができる。
【００３６】
また、ユーザが、個々の単語を言い換えるかどうかを個別に設定するようにしたので、ユーザの好みや習熟度に応じて、きめ細かいカスタマイズが可能となる。
【００３７】
《具体例２》
具体例２は、ユーザが習熟度を示す文書を入力し、これに基づいて、ユーザ情報格納テーブル中の言い換えの要否を設定するようにしたものである。
【００３８】
〈構成〉
図９は、具体例２の構成図である。
図示のシステムは、入出力装置１０、処理装置２０ａ、記憶装置３０からなる。ここで、入出力装置１０および記憶装置３０の構成は、具体例１と同様であるため、その説明は省略する。処理装置２０ａは、変換部２２と習熟度学習部２３からなる。即ち、具体例２では、具体例１における言い換え情報設定部２１に代わって習熟度学習部２３を設けたものである。ここで、変換部２２は具体例１と同様であるため、その説明は省略する。また、習熟度学習部２３は、ユーザが与えたテキスト中の語句が辞書データベース３１中の見出し語に一致した場合は、ユーザ情報格納テーブル３２において、言い換え不要と設定し、その語句が言い換え語に一致した場合は、言い換えが必要であると設定する機能を有している。
【００３９】
〈動作〉
図１０は、具体例２の動作を示すフローチャートである。
具体例２においても、ユーザは入力部１１からユーザＩＤを入力する等の方法で、ユーザ認証処理を行い（ステップＳ２０１）、習熟度の学習を行うか、言い換え処理を行うかを選択する（ステップＳ２０２）。ステップＳ２０２において、“Ｙ”の場合は変換部２２の動作となり、“Ｎ”の場合は習熟度学習部２３の動作となる。
【００４０】
１）習熟度の学習を行う場合（ステップＳ２０２で、“Ｎ”の場合）
ユーザが入力部１１からテキストを入力すると、習熟度学習部２３は、習熟度学習処理を行う（ステップＳ２０３）。ここで、ユーザが入力するテキストは、ユーザが作成したテキストでもよいし、インターネット上のＷｅｂページといったものであってもよい。即ち、ユーザが作成したテキストや、ユーザが読んで理解できたテキストに使われている用語は、ユーザが習得した単語であると考えることができる。このような観点から、対象となるユーザの習熟度学習処理を行う。
【００４１】
図１１は、習熟度学習処理の流れを示すフローチャートである。
習熟度学習処理を行う場合、先ず、入力されたテキストを形態素解析する（ステップＳ２２１）。次に、形態素解析結果の単語を１語ずつ辞書検索し、辞書データベース３１に登録されているかを調べる（ステップＳ２２２、Ｓ２２３）。その単語が辞書データベース３１の言い換え語格納部３１２に登録されている場合は、ユーザ情報格納テーブル３２の該当する欄に、言い換え要を示す情報「○」を格納し（ステップＳ２２４、Ｓ２２５）、次の単語の処理に移る（ステップＳ２２６）。一方、その単語が見出し語格納部３１１に登録されている場合は、ユーザ情報格納テーブル３２の該当する欄に、言い換え不要を示す情報「×」を格納し（ステップＳ２２４、Ｓ２２５）、次の単語の処理に移る（ステップＳ２２６）。辞書データベース３１の、見出し語格納部３１１、言い換え語格納部３１２のいずれにも登録されていない場合は、何も処理を行わずに次の単語の処理に移る（ステップＳ２２４、Ｓ２２６）。入力テキスト中の全ての単語に対してステップＳ２１１〜ステップＳ２２５の処理が終われば、習熟度学習処理を終了する。
【００４２】
尚、ユーザ情報格納テーブル３２の言い換え要／不要の情報が空欄のデータについては、予め定義された規則に従って（例えば、条件格納部３１３のレベルによって、「○」あるいは「×」を一括付与するなど）言い換え要／不要の情報を付与するようにしてもよい。
【００４３】
２）言い換え処理を行う場合（ステップＳ２０２で、“Ｙ”の場合）
具体例２において、ステップＳ２０４〜ステップＳ２０６の言い換え処理は、具体例１におけるステップＳ１０５〜ステップＳ１０７の処理と同様である。そして、具体例２では、出力部１２にて言い換え処理結果が表示された後、ユーザがその結果を用いて学習させたい場合は（ステップＳ２０７）、処理結果を修正する（ステップＳ２０８）。例えば、ユーザの習熟度が上がった場合、言い換えられた語句を元の単語（見出し語）に戻す、といった修正を行う。尚、このような場合は、出力結果中、元の単語と言い換え語を併記することが望ましい。
【００４４】
そして、ユーザがこの修正テキストにより習熟度学習を行いたい場合は、入力部１１より、このテキストを習熟度学習テキストとして入力する。これにより、習熟度学習部２３は、図１１に示す処理を行い、ユーザ情報格納テーブル３２に反映させる。
【００４５】
その後は、更に新しいテキストを処理するかを判断し（ステップＳ２０９）、新しいテキストを処理する場合は、ステップＳ２０４に戻り、そうでない場合は、ログアウトして（ステップＳ２１０）、処理を終了する。
【００４６】
〈効果〉
以上のように、具体例２によれば、習熟度学習のための文書中の単語が辞書データベース３１の見出し語や言い換え語に一致した場合、習熟度学習部２３によって、その単語を言い換え不要または言い換え要とする情報をユーザ情報格納テーブル３２に反映させるようにしたので、ユーザは、単語の言い換えの要否を一つ一つ登録しなくとも、文書を指定するだけでユーザ情報格納テーブル３２を容易にカスタマイズすることができる。
【００４７】
また、言い換え処理結果を修正し、習熟度の学習を行うようにしたので、容易かつ確実にユーザ情報格納テーブル３２のカスタマイズを行うことができる。
【００４８】
《具体例３》
具体例３は、言い換えシステムを、クライアント（入出力装置）と、サーバ（処理装置、データベース）とにより構成したものである。
【００４９】
〈構成〉
図１２は、具体例３の構成図である。
図示のシステムは、クライアント１００とサーバ２００とが通信回線を介して接続されることで実現されている。クライアント１００側には、入出力装置１０ａが設けられ、サーバ２００側には、処理装置２０ｂと記憶装置３０ａが設けられている。即ち、本具体例の全体の構成は、具体例１、２とほぼ同じであるが、ネットワークを介してテキストをクライアント１００とサーバ２００間でやり取りするようにした点が異なっている。
【００５０】
図１２において、入出力装置１０ａは、入力部１１、出力部１２、送受信部１３を備えている。ここで、入力部１１および出力部１２は具体例１、２の構成と同様である。また、送受信部１３は、サーバ２００とのデータのやり取りを行うためのクライアント１００側の送受信部である。
【００５１】
サーバ２００側の処理装置２０ｂは、変換部２２と送受信部２４を備えており、変換部２２は具体例１、２の構成と同様である。送受信部２４は、クライアント１００から送信されたテキストの受信を行って、これを形態素解析部２２１に送ったり、テキスト変換部２２３における言い換え処理結果をクライアント１００に送信するといった、サーバ２００側の送受信を行うための機能部である。
【００５２】
記憶装置３０ａは、基本的な構成は具体例１、２と同様であるが、辞書データベース３１のみ有している点が異なっている。
【００５３】
〈動作〉
図１３は、具体例３の動作を示すフローチャートである。
ユーザは、言い換え処理を行う場合、対象となるテキストと自身の習熟度の情報をサーバ２００に送信する（ステップＳ３０１）。ここで、ユーザ自身の習熟度とは、例えば、レベル１やレベル２といった情報である。
【００５４】
サーバ２００では、このような情報を送受信部２４が受け取ると、変換部２２が言い換え処理を行う（ステップＳ３０２）。尚、この言い換え処理については後述する。ステップＳ３０２で言い換え処理が行われると、送受信部２４は、その処理結果をクライアント１００に送信する（ステップＳ３０３）。これにより、クライアント１００側では、出力部１２にて処理結果を画面表示する（ステップＳ３０４）。そして、新しいテキストがある場合は（ステップＳ３０５）、ステップＳ３０１に戻って上述した処理を繰り返し、そうでない場合は言い換え処理を終了する。
【００５５】
図１４は、言い換え処理の動作を示すフローチャートである。
言い換え処理の流れは、具体例１、２とほぼ同じであるが、ユーザの習熟度情報をクライアント１００側で入力し、送信するようになっているので、この習熟度情報と辞書データベース３１の条件格納部３１３の情報とを比較して言い換えの要／不要を判定する点が異なっている。
【００５６】
先ず、テキストと共にユーザの習熟度情報を受け取ると、形態素解析部２２１にて、テキストを形態素解析する（ステップＳ３１１）。次に、辞書検索部２２２は、形態素解析結果から１語ずつ取り出して辞書データベース３１の辞書検索を行う（ステップＳ３１３）。そして、その単語が辞書登録されていた場合は、ユーザの習熟度レベルと辞書データベース３１に登録されている難易度レベル（条件格納部３１３の情報）とを比較し、単語の難易度がユーザの習熟度以上であるかを判定する（ステップＳ３１４）。このステップＳ３１４において、そうであった場合は、言い換え語をバッファに格納する（ステップＳ３１５）。一方、単語が辞書データベース３１に登録されていなかったり、単語の難易度がユーザの習熟度より低かった場合は、言い換えを行わずに、その単語をそのままバッファに格納する（ステップＳ３１６）。
【００５７】
そして、このような単語毎の処理を繰り返し、最後の単語が終了した場合（ステップＳ３１７）は、言い換え処理を終了する。
【００５８】
〈効果〉
以上のように、具体例３によれば、言い換え処理を行う処理装置２０ｂと辞書データベース３１をサーバ２００上に置き、クライアント１００からテキストと習熟度情報を与えるようにしたので、クライアント１００側の処理が軽くなり、クライアント１００側の記憶容量が少なくて済む、といった効果がある。これにより、携帯端末のように処理性能や記憶容量に制限のある装置に、言い換え処理システムを組み込むことが可能となる。
【００５９】
尚、上記具体例３において、具体例１、２と同様に、ユーザ情報格納テーブル３２をサーバ２００側に設け、クライアント１００側からユーザＩＤ等の識別情報を送信するように構成してもよい。
【００６０】
《具体例４》
具体例４は、見出し語の文脈上の関係に基づく条件に対応した言い換え語を辞書データベースに格納し、言い換え処理を行う場合は、その条件に対応した言い換え語を選択するようにしたものである。
【００６１】
〈構成〉
図１５は、具体例４の構成図である。
図示のシステムは、入出力装置１０、処理装置２０ｃ、記憶装置３０ａからなる。ここで、入出力装置１０および記憶装置３０の基本的な構成は、具体例１、２と同様であるため、その説明は省略する。処理装置２０ｃにおける変換部２５は、形態素解析部２２１、辞書検索部２２２、テキスト変換部２２３、条件照合部２２４からなる。尚、処理装置２０ｃ中、具体例１の言い換え情報設定部２１または具体例２の習熟度学習部２３を備えているが、その図示は省略している。
【００６２】
変換部２５は、見出し語の条件に応じて言い換え語を変換する機能を有し、形態素解析部２２１〜テキスト変換部２２３は、具体例１、２と同様である。条件照合部２２４は、辞書データベース３１ａ中の後述する第１、第２の条件に基づいて言い換え語を選択し、この選択情報をテキスト変換部２２３に渡す機能を有している。
【００６３】
辞書データベース３１ａは、辞書データベース３１ａとユーザ情報格納テーブル３２からなり、ユーザ情報格納テーブル３２は具体例１、２と同様である。辞書データベース３１ａは、見出し語に対する異なる条件に対応した言い換え語を備えたデータベースであり、次のように構成されている。
【００６４】
図１６は、具体例４の辞書データベース３１ａの説明図である。
具体例４の辞書データベース３１は、図示のように、見出し語格納部３１１、言い換え語格納部３１２、属性情報格納部３１４、第１の条件格納部３１５、第２の条件格納部３１６からなる。ここで、見出し語格納部３１１と言い換え語格納部３１２は、具体例１〜３の辞書データベース３１の構成と同様である。属性情報格納部３１４は、見出し語や言い換え語の意味や文脈上の関係を示す情報を格納する機能部である。また、第１の条件格納部３１５は、具体例１〜３における条件格納部３１３と同様の情報である第１の条件を格納する機能部である。更に、第２の条件格納部３１６は、見出し語の文脈上の関係に基づく条件、即ち、見出し語がどのような条件で使用されるかを示す情報である第２の条件を格納する機能部である。例えば、図中のＤ１６１は「コンフィ」の意味が「調理法」で、「酢漬け」と言い換えるための条件は、入力が「〜のコンフィ」という表現で、「〜」の部分に来る名詞の意味が「野菜」であることを示している。
【００６５】
〈動作〉
図１７は、具体例４の言い換え処理の動作を示すフローチャートである。
図１８は、入力テキストの一例を示す説明図である。
具体例４では、辞書検索を行う際に適用条件を調べ、条件にマッチした場合にのみ言い換え処理を行う（ステップＳ４０１〜ステップＳ４０７）。以下、図１８に示すテキストが入力された場合を例に説明する。
【００６６】
先ず、図１８におけるＴＸ１８１のテキストが入力されると、形態素解析部２２１により形態素解析を行い（ステップＳ４０１）、その結果を辞書検索部２２２によって１語ずつ辞書検索する（ステップＳ４０２、Ｓ４０３）。図１６に示す辞書データベース３１ａで「ポワブロン」を参照すると、第２の条件格納部３１６が空なので、言い換え語「ピーマン」をバッファに格納する（ステップＳ４０４、Ｓ４０５）。次に、「の」は辞書データベース３１ａに登録されていないので、その語句「の」をそのままバッファに格納する（ステップＳ４０４、Ｓ４０６）。
【００６７】
更に、「コンフィ」を検索すると（ステップＳ４０３）、Ｄ１６１とＤ１６２の２種類の言い換え候補があり、それぞれ第２の条件格納部３１６にこのデータを適用するための第２の条件が記述されている。ＴＸ１８１の場合、「コンフィ」の前に来る語が「ポワブロン」で、また、Ｄ１６３の属性情報格納部３１４の情報により、その意味は「野菜」であることが分かるのでＤ１６１が適用され、言い換え語「酢漬け」がバッファに格納される（ステップＳ４０４、Ｓ４０５）。
【００６８】
同様に、ＴＸ１８２のテキストが入力されると、今度は「コンフィ」の前に来る語が「アナナ」で、Ｄ１６４の属性情報格納部３１４の情報により、その意味は「果物」であることが分かるので、Ｄ１６２が適用され、言い換え語「砂糖漬け」がバッファに格納される（ステップＳ４０４、Ｓ４０５）。
【００６９】
図１９は、言い換え処理を行った後のバッファの内容を示す説明図である。
図示のように、ＴＸ１９１では「コンフィ」が「酢漬け」と言い換えられており、一方、ＴＸ１９２では、「コンフィ」が「砂糖漬け」と言い換えられている。
【００７０】
言い換え処理が終了すると、具体例１、２と同様に、バッファの内容を出力画面に表示し、新しいテキストの言い換え要求があれば、同様の処理を繰り返して行う。要求がなければ処理を終了する。
【００７１】
〈効果〉
以上のように、具体例４によれば、ある見出し語に対して異なる条件によって言い換え語を選択するようにしたので、複数の言い換えの可能性がある場合に、辞書データベース３１ａに記述された条件を参照して、最適な言い換え語を選択することができる。
【００７２】
《具体例５》
具体例５は、テキストに対して形態素解析を行うことなく、言い換え処理を行うようにしたものである。
【００７３】
〈構成〉
図２０は、具体例５の構成図である。
図示のシステムは、入出力装置１０、処理装置２０ｄ、記憶装置３０からなる。ここで、入出力装置１０および記憶装置３０の構成は、具体例１、２と同様であるため、その説明は省略する。処理装置２０ｄは、言い換え情報設定部２１と変換部２６からなり、言い換え情報設定部２１は、具体例１、２の構成と同様である。変換部２６は、辞書検索部２２２ａとテキスト変換部２２３からなり、テキスト変換部２２３は各具体例のテキスト変換部２２３と同様である。即ち、具体例５の変換部２６は、具体例１の変換部２２における形態素解析部２２１がない点が異なっている。また、辞書検索部２２２ａは、入力テキスト中の語いの先頭から１文字ずつ辞書データベース３１中の見出し語と照合し、マッチする文字列があった場合は、この見出し語の言い換え処理を行うよう構成されている。
【００７４】
〈動作〉
処理の流れも具体例１とほぼ同様であるが、言い換え処理の詳細が異なっている。
図２１は、具体例５の言い換え処理の動作を示すフローチャートである。
具体例５では、変換部２６の辞書検索部２２２ａは、入力されたテキストの１文字目からｍ文字の文字列の辞書検索を行う（ステップＳ５０１、Ｓ５０２、Ｓ５０３）。例えば、図５中のＴＸ５１の文を例に説明すると、「チ」「チョ」「チョリ」…のように、１文字目から始まる文字列をキーに辞書検索し（ステップＳ５０３）、見出し語とマッチする文字列があれば（ステップＳ５０４）、言い換え処理を行う（ステップＳ５０５、Ｓ５０６、Ｓ５０７）。尚、このとき、検索文字列の最長の値を設定するなどのことにより、検索回数を減らす工夫をしてもよい。また、複数の文字列にマッチした場合は、全ての文字列に対して言い換え処理を行う、あるいは、マッチした中で最長の文字列を選択する等の方法で処理を進める。
【００７５】
ＴＸ５１の例では、「チョリソ」がマッチするので、「チョリソ」の言い換え語である「辛口ソーセージ」をバッファに格納し（ステップＳ５０６）、ｎ＝１＋４とする（ステップＳ５０７）。ステップＳ５１２において、ｎが最後の文字ではないので、ステップＳ５０２に戻り、５文字目の「入」から辞書検索を行う（ステップＳ５０３）。「入」で始まる見出し語が辞書にないので（ステップＳ５０４、Ｓ５０８、Ｓ５０９）、「入」をバッファに格納する（ステップＳ５１０）。そして、ｎ＝ｎ＋１として（ステップＳ５１１）、最後の文字でない場合（ステップＳ５１２）は、ステップＳ５０２に戻る。
【００７６】
即ち、ステップＳ５０４において、見出し語が一致しなかった場合は、検索対象をｎ番目の文字から１文字ずつ増やし、これを最後の文字まで行い、それでも見出し語に一致しなかった場合は、ｎ番目の文字をバッファに格納するものである。
【００７７】
「入」がバッファに格納されると、次に「り」で始まる見出し語の辞書検索を行う（ステップＳ５０２、Ｓ５０３、Ｓ５０８、Ｓ５０９）。これも辞書にないので、「り」をバッファに格納する（ステップＳ５１０）。
以上のような処理を入力テキストの最後まで、繰り返し行う。
【００７８】
〈効果〉
以上のように、具体例５によれば、具体例１と比べて単語認識の精度は落ちるが、形態素解析を行わないため、処理が軽くなるという効果がある。
【００７９】
《利用形態》
上記各具体例では、言い換え対象のテキストとしてレストランのメニューを例として説明したが、これに限定されるものではなく、辞書データベース３１（３１ａ）の内容を変えるだけで、様々な分野に適用することができる。例えば、漢字をひらがなにしたり、読み仮名をふったりするシステムや、病院のカルテを患者に分かり易く言い換えるシステムといったことにも適用が可能である。
また、外来語（カタカナ語）と和語（漢字、ひらがな語）や、標準語と方言などを対応付けて登録することにより、文書作成システムや文書校正システムの一部に組み込んで、表現や用語体系の統一を図ることができる。
【００８０】
上記各具体例では、言い換え語の表示形態は、見出し語と置き換える方法について述べたが、見出し語と言い換え語を併記するようにしてもよいし、見出し語をマウス等で指定すると言い換え語が表示される、といった表示形態であってもよい。
また、言い換え語を下線付きで表示したり、表示色を変えるといった、言い換えていない箇所とは区別できるように、異なる表示を行ってもよい。
【００８１】
各具体例では、単語を言い換える例について説明したが、辞書検索を行う際、複数語による検索を行えるようにすれば、言い換えの対象がイディオムや熟語等、複数の単語からなる語句であっても構わない。
【００８２】
各具体例では、辞書データベース３１（３１ａ）とユーザ情報格納テーブル３２とを別体としたが、これらを一体のデータベースとして設けてもよい。また、ユーザ情報格納テーブル３２は、ユーザ毎に保有するようにしてもよい。
【００８３】
各具体例では、条件格納部３１３や第１の条件格納部３１５に格納される条件を１次元の値として説明したが、この条件は二つ以上の条件を組み合わせた複数次元のものであってもよい。例えば、条件の一つに難易度、もう一つに言い換え語の表記（ひらがな・カタカナ・漢字など）を記述しておく。これにより、見出し語を言い換える際に、ユーザ１はカタカナ語での言い換え語を優先するが、ユーザ２は和語（漢字）を優先するなどの処理が可能となる。
【００８４】
具体例２においては、単語毎の言い換え情報の登録について、言い換え処理結果を使って対話的に登録する方法について述べたが、必要な情報を記述したファイルから一括して登録するような手段を設けてもよい。
また、言い換え処理の結果をユーザが修正処理する場合、この処理に対するモニタリング手段を設け、ユーザが言い換えたい単語、あるいは言い換えたくない単語の情報を取得するようにしてもよい。
【００８５】
具体例４では、属性情報格納部３１４に単語の意味情報を格納した例を示したが、これに限らず、文法情報（その単語がとりうる構文の情報）や分野情報、字種情報等、種々の情報を格納することができる。
【００８６】
具体例５では、辞書データベース３１の構成として具体例１〜３の辞書データベース３１の構成としたが、具体例４の辞書データベース３１ａの構成とし、具体例４と同様の言い換え処理を行うようにしてもよい。
【００８７】
【発明の効果】
以上のように、本発明によれば、見出し語に対して、この見出し語を別の表現で表す言い換え語を格納する辞書データベースを設け、この辞書データベースを用いて任意の語句を言い換え語に変換するようにしたので、専門用語のように分かりにくい表現を分かり易い表現に言い換えることができる。
【図面の簡単な説明】
【図１】本発明の言い換えシステムの具体例１の構成図である。
【図２】具体例１の辞書データベースとユーザ情報格納テーブルの説明図である。
【図３】具体例１の動作を示すフローチャートである。
【図４】具体例１の言い換え処理の動作を示すフローチャートである。
【図５】文書の一例としてレストランのメニューを示す説明図である。
【図６】図５中のテキストの１文を形態素解析した結果の説明図である。
【図７】作業用バッファの内容の説明図である。
【図８】言い換え処理後のメニューの説明図である。
【図９】具体例２の構成図である。
【図１０】具体例２の動作を示すフローチャートである。
【図１１】習熟度学習処理を示すフローチャートである。
【図１２】具体例３の構成図である。
【図１３】具体例３の動作を示すフローチャートである。
【図１４】言い換え処理の動作を示すフローチャートである。
【図１５】具体例４の構成図である。
【図１６】具体例４の辞書データベースの説明図である。
【図１７】具体例４の言い換え処理の動作を示すフローチャートである。
【図１８】具体例４の入力テキストの説明図である。
【図１９】具体例４の作業用バッファの内容の説明図である。
【図２０】具体例５の構成図である。
【図２１】具体例５の言い換え処理の動作を示すフローチャートである。
【符号の説明】
１０、１０ａ入出力装置
２１言い換え情報設定部
２２、２５、２６変換部
２３習熟度学習部
３１、３１ａ辞書データベース
３２ユーザ情報格納テーブル[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to a paraphrase system for translating text into paraphrases suitable for a user, and in particular, to understand foreign words and technical terms according to the level of the user (proficiency, possessed knowledge, physical limitations, etc.). It relates to a paraphrasing system that is easily paraphrased.
[0002]
[Prior art]
Conventionally, there has been a technique for switching a display screen according to a user's age, visual acuity, and proficiency (for example, see Patent Document 1).
In this document, the user sets the age so that the kanji to be used is restricted for elementary school children, and the display screen is changed, such as changing the size of characters to be used by performing eyesight measurement for elderly people. Performing techniques are disclosed.
[0003]
[Patent Document 1]
JP 2000-305746 A
[0004]
[Problems to be solved by the invention]
However, in the above-described conventional technology, the displayed content is uniform according to the level set by the system, and cannot be customized in detail according to the user's preference and proficiency.
[0005]
In recent years, in the field of computers and the like, foreign words (Katakana) and words that require specialized knowledge have been frequently used, and especially in the elderly, etc., there is a problem that understanding of sentences is hindered. is there. In such a case, a mechanism for paraphrasing difficult-to-understand expressions is necessary, but such a paraphrasing system has not been realized.
[0006]
[Means for Solving the Problems]
The present invention employs the following configuration to solve the above-described problem.
<Configuration 1>
A dictionary database that associates headwords with paraphrases that express the headwords in different expressions, and for any words and phrases, refers to the dictionary database, and if there is a matching headword, paraphrases the words and phrases A conversion system, comprising: a conversion unit that converts a word into a word and outputs the word.
[0007]
<Configuration 2>
In the paraphrase system according to Configuration 1, a user information storage table indicating whether or not a headword is paraphrased for each user, and, when a user is designated, an arbitrary information based on user information corresponding to the user. A paraphrase system comprising a conversion unit for converting a phrase into a paraphrase.
[0008]
<Configuration 3>
3. The paraphrasing system according to Configuration 2, further comprising a paraphrasing information setting unit that, when the user specifies whether to paraphrase the headword, reflects the specified content in the user information storage table.
[0009]
<Configuration 4>
In the paraphrasing system according to the configuration 2 or 3, when a word in the document provided by the user matches a headword in the dictionary database, a proficiency learning unit that sets the headword to be not paraphrased in the user information storage table. A paraphrasing system comprising:
[0010]
<Configuration 5>
In the paraphrase system according to the configuration 2 or 3, if the word in the document provided by the user matches the paraphrase in the dictionary database, the proficiency learning unit sets the headword as paraphrase required in the user information storage table. A paraphrasing system comprising:
[0011]
<Configuration 6>
In the paraphrase system according to any one of the constitutions 1 to 5, an input / output device connected via a communication line to the dictionary database and the conversion unit is provided, and the input / output device converts an arbitrary phrase together with a paraphrase request into the conversion unit. , And configured to receive the paraphrase result from the conversion unit.
[0012]
<Configuration 7>
In the paraphrase system according to any one of the constitutions 1 to 6, the dictionary database including the paraphrase corresponding to the condition based on the context relation of the headword, and the context relation of the phrase to an arbitrary phrase are shown. Based on the information, the dictionary database is referred to, and if there is a headword whose information satisfies the condition based on the contextual relationship, a conversion unit is provided that converts the headword into a paraphrase corresponding to the headword. A paraphrase system to do.
[0013]
<Configuration 8>
The paraphrasing system according to any one of Configurations 1 to 7, further comprising a conversion unit configured to morphologically analyze the input character string to extract words, and to convert a headword into a paraphrase based on the extracted words. Paraphrase system characterized by the following.
[0014]
<Configuration 9>
In the paraphrasing system according to any one of the constitutions 1 to 7, the character string is increased one character at a time from an arbitrary character in the document, and the character string is compared with a headword. A paraphrase system comprising a conversion unit that determines that the phrase is a phrase.
[0015]
<Configuration 10>
10. The paraphrasing system according to any one of Configurations 1 to 9, wherein a portion converted into a paraphrase is displayed differently from a portion not converted.
[0016]
BEST MODE FOR CARRYING OUT THE INVENTION
Hereinafter, embodiments of the present invention will be described in detail using specific examples.
<< Specific Example 1 >>
<Constitution>
FIG. 1 is a configuration diagram showing a specific example 1 of the paraphrasing system of the present invention.
The illustrated system is configured by a computer, and includes an input / output device 10, a processing device 20, and a storage device 30. The input / output device 10 is a functional unit that inputs a document to be paraphrased and outputs a paraphrase result, and includes an input unit 11 and an output unit 12. The input unit 11 has a function of performing a text document input process such as a text input process using a keyboard or a pointing device such as a mouse, a text input process using a scanner and a character recognition processing unit, and a text input process using a microphone and a voice recognition process. ing. The input unit 11 has a function of inputting paraphrase information individually set by the user. The output unit 12 has a function of performing display on a display device, conversion into audio, audio output, and output to a file, and outputs a processing result of text input from the input unit 11.
[0017]
The processing device 20 includes an arithmetic device, a memory, a control unit, and the like, and has a function of executing a process of setting a user's proficiency level and rephrasing a text input from the input unit 11 into another expression. are doing.
The processing device 20 includes a paraphrase information setting unit 21 and a conversion unit 22. The paraphrase information setting unit 21 is a functional unit that sets the proficiency of each user input from the input unit 11 and stores paraphrase information of a specific word in a user information storage table 32 described later.
[0018]
The conversion unit 22 refers to a later-described dictionary database 31 and a user information storage table 32 in the storage device 30 for an arbitrary word in the text input from the input unit 11, and determines that there is a matching headword. Has a function of converting the phrase into a paraphrase based on the user information in the user information storage table 32, and includes a morphological analysis unit 221, a dictionary search unit 222, and a text conversion unit 223.
[0019]
The morphological analysis unit 221 is a processing unit that performs a known morphological analysis for performing a process of dividing a text into words. The dictionary search unit 222 is a functional unit that performs processing for acquiring information associated with a headword with reference to the dictionary database 31 for the words divided by the morphological analysis unit 221. The text conversion unit 223 is a functional unit that performs a process of paraphrasing text based on the dictionary search result of the dictionary search unit 222.
[0020]
The storage device 30 includes a storage device such as a hard disk device, an optical disk device, or a semiconductor memory, and includes a dictionary database 31 and a user information storage table 32. The dictionary database 31 is a database that stores paraphrase data. The user information storage table 32 is a table for storing user proficiency level information.
[0021]
FIG. 2 is an explanatory diagram of the dictionary database 31 and the user information storage table 32.
As shown in the figure, the dictionary database 31 includes a headword storage unit 311, a paraphrase storage unit 312, and a condition storage unit 313, and one record is formed by the information of the user information storage table 32 corresponding to these fields. . The headword storage unit 311 stores words (headwords) that require paraphrase. Further, the paraphrase storage unit 312 stores a phrase (paraphrase) in which each headword is paraphrased with another expression. Further, the condition storage unit 313 is information for setting a condition for applying the paraphrase processing, and in this specific example, indicates the difficulty of the headword. Here, the difficulty level indicates the degree of specialization of the word. In general, the higher the difficulty, the harder it is to understand, and a paraphrase is necessary. In this specific example, it is assumed that the proficiency level and the difficulty level correspond to each other. Further, the user information storage table 32 indicates, for each piece of identification information (ID) of each user, information on whether or not to paraphrase each headword.
[0022]
<motion>
FIG. 3 is a flowchart showing the operation of the first embodiment.
First, a user using the present apparatus performs a user authentication process by inputting a user ID or the like from the input unit 11 (step S101). Then, selection is made as to whether to set paraphrase information or to perform paraphrase processing (step S102). In step S102, in the case of "Y", the operation of the conversion unit 22 is performed, and in the case of "N", the operation of the paraphrase information setting unit 21 is performed.
[0023]
1) When setting paraphrase information (in the case of "N" in step S102)
The setting of the paraphrase information includes a method of designating the overall level according to the criteria of the device, and a method of setting whether or not it is necessary to paraphrase each word.
[0024]
1-1) First, a method of setting paraphrase information by specifying the overall level will be described. Here, it is assumed that level 1 is defined as beginner, level 2 as intermediate, and level 3 as advanced.
[0025]
When the user selects and inputs his / her own level through the input / output device 10, the paraphrase information setting unit 21 sets this information in the user information storage table 32. For example, when the user selects level 2 (step S103), words having a difficulty level of level 2 or higher, that is, words of level 2 and level 3 are to be paraphrased, and the paraphrase information setting unit 21 sets the headwords of these levels. Is added to the field of the user information storage table 32 corresponding to the information (step S104). In FIG. 2, the user information storage table 32 of ID1 shows a state where the user of ID1 has selected “intermediate” = level 2.
[0026]
1-2) Next, a method of setting paraphrase information for each word will be described.
There are cases where it is desired or not desired to paraphrase a specific word regardless of the level of the user. In such a case, a word and information on whether to paraphrase the word are input through the input / output device 10 (step S103). Thereby, the paraphrase information setting unit 21 gives information on whether or not to paraphrase the word to the user information storage table 32 of the specified word (step S104). In FIG. 2, the user information storage table 32 of ID2 is a word of level 2 after the user of ID2 selects “intermediate” = level 2 and paraphrases the word of “level 1” “choriso”. This shows a state in which “Confid” is set to be unnecessary.
[0027]
2) When performing paraphrasing processing (in the case of “Y” in step S102)
When performing the paraphrasing process, the user inputs a text to be paraphrased from the input unit 11 (step S105). Thereby, the paraphrasing process is performed on the text in the conversion unit 22 (step S106), and the processing result is output from the output unit 12 by means such as a screen display (step S107). Thereafter, when processing a newer text (step S108), the process returns to step S105. Otherwise, the user logs out (step S109) and ends the process.
[0028]
Next, the operation of the paraphrasing process in step S106 will be described in detail.
FIG. 4 is a flowchart illustrating the operation of the paraphrasing process.
First, the morphological analysis unit 221 performs a morphological analysis on the input text (step S111). Next, the dictionary search unit 222 extracts one word from the morphological analysis result by the morphological analysis unit 221 (step S112), and searches the dictionary database 31 using the word as a key (step S113). If the word is not registered in the dictionary database 31, the word is stored in a buffer (not shown) in the conversion unit 22 (steps S114 and S115). If the word is registered in the dictionary database 31 as a headword, the corresponding paraphrase is stored in the buffer according to the information in the user information storage table 32 of the user (step S116). Next, it is determined whether the word extracted in step S112 is the last word of the text (step S117). If the word is not the last word, the process returns to step S112, and the above-described steps S112 to S115 (or step S116). Is repeated.
[0029]
FIG. 5 is a diagram illustrating a restaurant menu as an example of a document.
Hereinafter, the flow of the process when the user of ID2 performs the paraphrase process using this text will be specifically described.
[0030]
FIG. 6 is an explanatory diagram of the result of morphological analysis of one sentence of the text in FIG.
FIG. 7 is an explanatory diagram of the contents of the work buffer.
FIG. 8 is an explanatory diagram of the menu after the paraphrasing process.
[0031]
When performing the paraphrasing process, first, "chorizo" is read from the morphological analysis result (TX61) in FIG. Next, as shown in FIG. 2, when the dictionary database 31 is searched, it can be seen that the field of "chorizo" of ID2 is set to "o", that is, that paraphrase is required. Therefore, "dry sausage" which is a paraphrase of "chorizo" is stored in the buffer. That is, TX71 “dry sausage” in FIG. 7 is stored in the buffer.
[0032]
Since the user of ID2 has the proficiency level of 2, the word "chorizo" is a word of a difficulty level lower than the proficiency level of the user, but the paraphrase is performed by giving priority to the information in the user information storage table 32. Will be.
[0033]
Next, “enter” is read from the morphological analysis result shown in FIG. 6, and the dictionary database 31 is similarly searched. Since "enter" is not registered in the dictionary database 31, "enter" is registered in the buffer as it is (the state of TX72 in FIG. 7). Then, the same processing is performed on “Fu beans” to “Confi”.
[0034]
TX73 in FIG. 7 indicates the contents of the buffer at the time when the paraphrasing process for the sentence TX61 in FIG. 6 is completed. That is, paraphrase processing is performed on “lagou” and “poiblond” in the TX 61, and “confid” is output as it is because the user with ID 2 does not need to paraphrase. FIG. 8 shows the result of performing such a paraphrasing process on all the sentences in the menu shown in FIG. When the paraphrasing process is completed for all the sentences of the input text, the text conversion unit 223 sends the processing result to the output unit 12, and the output unit 12 displays the processing result on a screen or the like.
[0035]
<effect>
As described above, according to the specific example 1, the dictionary database 31 indicating the paraphrase for the headword is provided, and the vocabulary in the input sentence is converted into the paraphrase based on the dictionary database 31. Expressions that are difficult to understand, such as technical terms, can be paraphrased into expressions that are easy to understand according to the user's proficiency.
[0036]
Further, since the user individually sets whether or not to paraphrase each word, fine customization can be performed according to the user's preference and proficiency.
[0037]
<< Specific Example 2 >>
In the specific example 2, the user inputs a document indicating the proficiency level, and based on the document, sets whether or not the paraphrase is necessary in the user information storage table.
[0038]
<Constitution>
FIG. 9 is a configuration diagram of the specific example 2.
The illustrated system includes an input / output device 10, a processing device 20a, and a storage device 30. Here, the configurations of the input / output device 10 and the storage device 30 are the same as those in the first embodiment, and thus description thereof will be omitted. The processing device 20a includes a conversion unit 22 and a proficiency learning unit 23. That is, in the specific example 2, a skill learning unit 23 is provided in place of the paraphrase information setting unit 21 in the specific example 1. Here, since the conversion unit 22 is the same as that of the first embodiment, the description thereof is omitted. In addition, when the phrase in the text given by the user matches the headword in the dictionary database 31, the proficiency learning unit 23 sets the paraphrase unnecessary in the user information storage table 32, and the phrase becomes a paraphrase. If they match, a function to set that paraphrase is necessary is provided.
[0039]
<motion>
FIG. 10 is a flowchart illustrating the operation of the second embodiment.
Also in the specific example 2, the user performs a user authentication process by a method such as inputting a user ID from the input unit 11 (step S201), and selects whether to learn the proficiency level or perform a paraphrase process (step S201). S202). In step S202, in the case of "Y", the operation of the conversion unit 22 is performed, and in the case of "N", the operation of the proficiency learning unit 23 is performed.
[0040]
1) When learning the proficiency (in the case of "N" in step S202)
When the user inputs a text from the input unit 11, the proficiency learning unit 23 performs a proficiency learning process (Step S203). Here, the text input by the user may be a text created by the user or a web page on the Internet. In other words, terms used in texts created by the user and texts that the user can read and understand can be considered to be words acquired by the user. From such a viewpoint, the proficiency learning process of the target user is performed.
[0041]
FIG. 11 is a flowchart illustrating the flow of the proficiency learning process.
When performing the proficiency learning process, first, the input text is subjected to morphological analysis (step S221). Next, the words of the morphological analysis result are searched in a dictionary one by one, and it is checked whether or not the words are registered in the dictionary database 31 (steps S222 and S223). If the word is registered in the paraphrase storage unit 312 of the dictionary database 31, information “O” indicating the need for paraphrase is stored in a corresponding column of the user information storage table 32 (steps S224 and S225). Shift to the processing of the word (step S226). On the other hand, if the word is registered in the headword storage unit 311, information “x” indicating that paraphrase is unnecessary is stored in a corresponding column of the user information storage table 32 (steps S 224 and S 225), and the next word is stored. (Step S226). If it is not registered in any of the headword storage unit 311 and the paraphrase storage unit 312 in the dictionary database 31, the process proceeds to the next word without performing any processing (steps S224 and S226). When the processing of steps S211 to S225 is completed for all the words in the input text, the proficiency learning processing ends.
[0042]
In addition, for data in which the paraphrasing / unnecessary information in the user information storage table 32 is blank, according to a predefined rule (for example, “O” or “X” is collectively assigned according to the level of the condition storage unit 313). ) Paraphrasing / unnecessary information may be added.
[0043]
2) When performing paraphrase processing (in the case of "Y" in step S202)
In the specific example 2, the paraphrasing processing of steps S204 to S206 is the same as the processing of steps S105 to S107 in the specific example 1. Then, in the specific example 2, after the paraphrase processing result is displayed on the output unit 12, if the user wants to learn using the result (step S207), the processing result is corrected (step S208). For example, when the user's proficiency increases, a correction is made such that the paraphrased word is returned to the original word (headword). In such a case, it is desirable to write the original word and the paraphrase in the output result.
[0044]
Then, when the user wants to perform proficiency learning using the corrected text, the user inputs this text as a proficiency learning text from the input unit 11. Thereby, the proficiency learning unit 23 performs the process shown in FIG. 11 and reflects the process on the user information storage table 32.
[0045]
Thereafter, it is determined whether to process a new text (step S209). If the new text is to be processed, the process returns to step S204. If not, the user logs out (step S210) and ends the process.
[0046]
<effect>
As described above, according to the specific example 2, when a word in a document for learning proficiency matches a headword or paraphrase in the dictionary database 31, the proficiency learning unit 23 does not need to paraphrase the word. Since the information necessary for paraphrase is reflected in the user information storage table 32, the user does not need to register the necessity of word paraphrase one by one. Can be easily customized.
[0047]
Further, since the paraphrase processing result is corrected and the learning of the proficiency level is performed, the user information storage table 32 can be easily and reliably customized.
[0048]
<< Specific Example 3 >>
In Example 3, the paraphrasing system is configured by a client (input / output device) and a server (processing device, database).
[0049]
<Constitution>
FIG. 12 is a configuration diagram of the third embodiment.
The illustrated system is realized by connecting a client 100 and a server 200 via a communication line. The input / output device 10a is provided on the client 100 side, and the processing device 20b and the storage device 30a are provided on the server 200 side. That is, the overall configuration of this example is substantially the same as in Examples 1 and 2, except that text is exchanged between the client 100 and the server 200 via the network.
[0050]
12, the input / output device 10a includes an input unit 11, an output unit 12, and a transmission / reception unit 13. Here, the input unit 11 and the output unit 12 are the same as those in the first and second embodiments. The transmission / reception unit 13 is a transmission / reception unit on the client 100 side for exchanging data with the server 200.
[0051]
The processing device 20b on the server 200 side includes a conversion unit 22 and a transmission / reception unit 24, and the conversion unit 22 has the same configuration as the first and second embodiments. The transmission / reception unit 24 performs transmission / reception on the server 200 side, such as receiving the text transmitted from the client 100 and transmitting the text to the morphological analysis unit 221 or transmitting the paraphrase processing result in the text conversion unit 223 to the client 100. It is a functional part for performing.
[0052]
The storage device 30a has the same basic configuration as those of the first and second embodiments, but differs in that it has only the dictionary database 31.
[0053]
<motion>
FIG. 13 is a flowchart illustrating the operation of the third embodiment.
When performing the paraphrasing process, the user transmits the target text and information on his / her proficiency level to the server 200 (step S301). Here, the user's own proficiency is, for example, information such as level 1 and level 2.
[0054]
In the server 200, when the transmission / reception unit 24 receives such information, the conversion unit 22 performs a paraphrase process (step S302). This paraphrasing process will be described later. When the paraphrasing process is performed in step S302, the transmission / reception unit 24 transmits the processing result to the client 100 (step S303). Thereby, on the client 100 side, the processing result is displayed on the screen by the output unit 12 (step S304). If there is a new text (step S305), the process returns to step S301 and repeats the above-described processing. Otherwise, the paraphrasing processing ends.
[0055]
FIG. 14 is a flowchart illustrating the operation of the paraphrasing process.
The flow of the paraphrase process is almost the same as in the first and second embodiments, but the user's proficiency information is input and transmitted on the client 100 side. The difference is that the necessity / unnecessity of paraphrase is determined by comparing the information of the storage unit 313.
[0056]
First, when the user's proficiency level information is received together with the text, the morphological analysis unit 221 performs morphological analysis on the text (step S311). Next, the dictionary search unit 222 extracts a word at a time from the morphological analysis result and performs a dictionary search of the dictionary database 31 (step S313). If the word is registered in the dictionary, the user's proficiency level is compared with the difficulty level registered in the dictionary database 31 (information in the condition storage unit 313), and the difficulty of the word is determined by the user. It is determined whether the proficiency level is higher than the proficiency level (step S314). If this is the case in step S314, the paraphrase is stored in the buffer (step S315). On the other hand, if the word is not registered in the dictionary database 31 or the difficulty of the word is lower than the user's proficiency, the word is stored in the buffer as it is without performing paraphrase (step S316).
[0057]
Then, such processing for each word is repeated, and when the last word ends (step S317), the paraphrasing processing ends.
[0058]
<effect>
As described above, according to the third embodiment, the processing device 20b for performing the paraphrasing process and the dictionary database 31 are placed on the server 200, and the text and the proficiency information are provided from the client 100. And the storage capacity of the client 100 side can be reduced. As a result, the paraphrasing processing system can be incorporated into an apparatus having a limited processing performance and storage capacity, such as a portable terminal.
[0059]
Note that, in the specific example 3, as in the specific examples 1 and 2, the user information storage table 32 may be provided on the server 200 side, and the client 100 may transmit identification information such as a user ID.
[0060]
<< Specific Example 4 >>
In Example 4, a paraphrase corresponding to a condition based on the contextual relationship of a headword is stored in a dictionary database, and when performing paraphrase processing, a paraphrase corresponding to the condition is selected. .
[0061]
<Constitution>
FIG. 15 is a configuration diagram of the specific example 4.
The illustrated system includes an input / output device 10, a processing device 20c, and a storage device 30a. Here, the basic configurations of the input / output device 10 and the storage device 30 are the same as those of the first and second embodiments, and thus the description thereof is omitted. The conversion unit 25 in the processing device 20c includes a morphological analysis unit 221, a dictionary search unit 222, a text conversion unit 223, and a condition matching unit 224. Although the processing device 20c includes the paraphrase information setting unit 21 of the specific example 1 or the proficiency learning unit 23 of the specific example 2, the illustration is omitted.
[0062]
The conversion unit 25 has a function of converting a paraphrase according to the condition of a headword, and the morphological analysis unit 221 to the text conversion unit 223 are the same as those in the first and second specific examples. The condition matching unit 224 has a function of selecting a paraphrase based on first and second conditions described later in the dictionary database 31a and passing the selected information to the text conversion unit 223.
[0063]
The dictionary database 31a includes a dictionary database 31a and a user information storage table 32. The user information storage table 32 is similar to the first and second examples. The dictionary database 31a is a database provided with paraphrases corresponding to different conditions for headwords, and is configured as follows.
[0064]
FIG. 16 is an explanatory diagram of the dictionary database 31a of the specific example 4.
As shown, the dictionary database 31 of the specific example 4 includes a headword storage unit 311, a paraphrase word storage unit 312, an attribute information storage unit 314, a first condition storage unit 315, and a second condition storage unit 316. Here, the headword storage unit 311 and the paraphrase storage unit 312 have the same configuration as the dictionary database 31 of the first to third specific examples. The attribute information storage unit 314 is a function unit that stores information indicating the meaning of the headword or paraphrase or the relation in context. The first condition storage unit 315 is a functional unit that stores the first condition, which is the same information as the condition storage unit 313 in the first to third examples. Further, the second condition storage unit 316 stores a condition based on the contextual relationship of the headword, that is, a second condition that is information indicating under what condition the headword is used. It is. For example, D161 in the figure means that the meaning of "confit" is "cooking method", and the condition for rephrasing "pickled in pickles" is that the input is the expression "configuration of ~" Is a "vegetable".
[0065]
<motion>
FIG. 17 is a flowchart illustrating the operation of the paraphrase processing of the fourth example.
FIG. 18 is an explanatory diagram illustrating an example of the input text.
In the specific example 4, application conditions are checked when performing a dictionary search, and paraphrase processing is performed only when the conditions are matched (steps S401 to S407). Hereinafter, a case where the text shown in FIG. 18 is input will be described as an example.
[0066]
First, when the text of the TX 181 in FIG. 18 is input, the morphological analysis is performed by the morphological analysis unit 221 (step S401), and the result is dictionary-searched one by one by the dictionary search unit 222 (steps S402 and S403). Referring to "Poiblon" in the dictionary database 31a shown in FIG. 16, since the second condition storage section 316 is empty, the paraphrase "Pepper" is stored in the buffer (steps S404 and S405). Next, since "no" is not registered in the dictionary database 31a, the word "no" is stored in the buffer as it is (steps S404 and S406).
[0067]
Further, when "Confid" is searched (step S403), there are two types of paraphrase candidates, D161 and D162, and the second condition for applying this data is described in the second condition storage unit 316. . In the case of TX181, since the word preceding "Confi" is "Poireblon", and the meaning of the word is "vegetable" from the information in the attribute information storage unit 314 of D163, D161 is applied. "Pickled" is stored in the buffer (steps S404, S405).
[0068]
Similarly, when the text of TX182 is input, the word preceding “Confi” is “anana”, and the meaning of the word is “fruit” from the information in the attribute information storage unit 314 of D164. Therefore, D162 is applied, and the paraphrase "candied" is stored in the buffer (steps S404 and S405).
[0069]
FIG. 19 is an explanatory diagram showing the contents of the buffer after performing the paraphrasing process.
As shown, in TX191, "confi" is paraphrased as "pickled", while in TX192, "confi" is paraphrased as "candied".
[0070]
When the paraphrasing process is completed, the contents of the buffer are displayed on the output screen in the same manner as in the first and second examples, and if there is a new text paraphrasing request, the same process is repeated. If there is no request, the process ends.
[0071]
<effect>
As described above, according to the specific example 4, the paraphrase is selected under a different condition for a certain headword, so if there is a possibility of a plurality of paraphrases, the condition described in the dictionary database 31a is used. , An optimal paraphrase can be selected.
[0072]
<< Specific Example 5 >>
In Example 5, the paraphrasing process is performed without performing morphological analysis on the text.
[0073]
<Constitution>
FIG. 20 is a configuration diagram of the specific example 5.
The illustrated system includes an input / output device 10, a processing device 20d, and a storage device 30. Here, since the configurations of the input / output device 10 and the storage device 30 are the same as those of the first and second embodiments, the description thereof will be omitted. The processing device 20d includes a paraphrase information setting unit 21 and a conversion unit 26. The paraphrase information setting unit 21 has the same configuration as that of the first and second specific examples. The conversion unit 26 includes a dictionary search unit 222a and a text conversion unit 223, and the text conversion unit 223 is the same as the text conversion unit 223 of each specific example. That is, the conversion unit 26 of the specific example 5 is different from the conversion unit 22 of the specific example 1 in that the morphological analysis unit 221 is not provided. In addition, the dictionary search unit 222a checks each character from the beginning of the vocabulary in the input text with the headword in the dictionary database 31 and, if there is a matching character string, performs the paraphrase processing of the headword. It is configured.
[0074]
<motion>
The flow of the processing is almost the same as that of the first embodiment, but the details of the paraphrasing processing are different.
FIG. 21 is a flowchart illustrating the operation of the paraphrasing process of the specific example 5.
In the specific example 5, the dictionary search unit 222a of the conversion unit 26 performs a dictionary search for a character string of m characters from the first character of the input text (steps S501, S502, and S503). For example, using the sentence TX51 in FIG. 5 as an example, a dictionary search is performed using a character string starting from the first character as a key, such as “Chi”, “Cho”, “Chori”, etc. (step S503). If there is a matching character string (step S504), paraphrase processing is performed (steps S505, S506, S507). At this time, the number of searches may be reduced by setting the longest value of the search character string. When a plurality of character strings are matched, the processing is performed by performing a paraphrasing process on all the character strings or selecting the longest character string among the matched character strings.
[0075]
In the example of TX51, since "chorizo" matches, "dry sausage" which is a paraphrase of "chorizo" is stored in the buffer (step S506), and n = 1 + 4 (step S507). In step S512, since n is not the last character, the process returns to step S502, and a dictionary search is performed from the fifth character “ON” (step S503). Since there is no headword starting with "ON" in the dictionary (steps S504, S508, S509), "ON" is stored in the buffer (step S510). Then, n = n + 1 (step S511). If the character is not the last character (step S512), the process returns to step S502.
[0076]
That is, in step S504, if the headword does not match, the search target is increased by one character from the nth character, and the search is performed up to the last character. If the headword still does not match, the nth character is not searched. Is stored in the buffer.
[0077]
When "ON" is stored in the buffer, a dictionary search for a headword starting with "RI" is performed (steps S502, S503, S508, S509). Since this is also not in the dictionary, "RI" is stored in the buffer (step S510).
The above processing is repeated until the end of the input text.
[0078]
<effect>
As described above, according to the specific example 5, although the accuracy of word recognition is lower than that of the specific example 1, the morphological analysis is not performed, so that there is an effect that the processing is reduced.
[0079]
《Usage form》
In each of the above specific examples, a restaurant menu has been described as an example of the text to be paraphrased, but the present invention is not limited to this, and is applicable to various fields only by changing the contents of the dictionary database 31 (31a). Can be. For example, the present invention can be applied to a system for changing a kanji to hiragana or a reading kana, or a system for rephrasing a hospital chart so that a patient can easily understand it.
In addition, by registering foreign words (Katakana) and Japanese (Kanji, Hiragana) or standard languages and dialects in association, they can be incorporated into a part of the document creation system or document proofreading system to express expressions and terms. The system can be unified.
[0080]
In each of the above specific examples, the display form of the paraphrase is described as a method of replacing the headword with the headword. However, the headword and the paraphrase may be described together, or the paraphrase is displayed when the headword is designated with a mouse or the like. May be displayed.
In addition, a different display may be performed so that the paraphrase is displayed with an underline or the display color is changed so as to be distinguished from a non-paraphrase.
[0081]
In each specific example, an example in which words are paraphrased has been described.However, when performing a dictionary search, if a search with multiple words can be performed, even if the target of paraphrase is a word composed of a plurality of words, such as idioms and idioms I do not care.
[0082]
In each specific example, the dictionary database 31 (31a) and the user information storage table 32 are provided separately, but these may be provided as an integrated database. The user information storage table 32 may be held for each user.
[0083]
In each specific example, the condition stored in the condition storage unit 313 or the first condition storage unit 315 has been described as a one-dimensional value. However, this condition is a multidimensional one obtained by combining two or more conditions. Is also good. For example, one of the conditions describes the difficulty level, and the other describes the notation of the paraphrase (such as hiragana, katakana, or kanji). Thus, when paraphrasing the headword, the user 1 can prioritize the paraphrase in katakana, while the user 2 can perform processing such as prioritizing Japanese (kanji).
[0084]
In the specific example 2, the method of registering the paraphrase information for each word interactively using the paraphrase processing result has been described. However, a means for registering the necessary information collectively from a file in which necessary information is described is provided. You may.
When the user corrects the result of the paraphrasing process, a monitoring unit for this process may be provided to acquire information on a word that the user wants to paraphrase or a word that the user does not want to paraphrase.
[0085]
In the specific example 4, an example in which the semantic information of the word is stored in the attribute information storage unit 314 is shown. However, the present invention is not limited to this. Various information can be stored.
[0086]
In the specific example 5, the configuration of the dictionary database 31 of the specific examples 1 to 3 was used as the configuration of the dictionary database 31. Is also good.
[0087]
【The invention's effect】
As described above, according to the present invention, for a headword, a dictionary database for storing a paraphrase for expressing the headword in another expression is provided, and an arbitrary phrase is converted to a paraphrase using this dictionary database. As such, it is possible to rephrase difficult-to-understand expressions such as technical terms into easy-to-understand expressions.
[Brief description of the drawings]
FIG. 1 is a configuration diagram of a specific example 1 of a paraphrasing system of the present invention.
FIG. 2 is an explanatory diagram of a dictionary database and a user information storage table of a specific example 1.
FIG. 3 is a flowchart illustrating an operation of a specific example 1.
FIG. 4 is a flowchart illustrating an operation of a paraphrasing process of a specific example 1.
FIG. 5 is an explanatory diagram showing a restaurant menu as an example of a document.
FIG. 6 is an explanatory diagram of a result of morphological analysis of one sentence of the text in FIG. 5;
FIG. 7 is an explanatory diagram of the contents of a work buffer.
FIG. 8 is an explanatory diagram of a menu after the paraphrasing process.
FIG. 9 is a configuration diagram of a specific example 2.
FIG. 10 is a flowchart illustrating an operation of a specific example 2;
FIG. 11 is a flowchart showing proficiency learning processing.
FIG. 12 is a configuration diagram of a specific example 3.
FIG. 13 is a flowchart illustrating an operation of a specific example 3.
FIG. 14 is a flowchart showing an operation of a paraphrase process.
FIG. 15 is a configuration diagram of a specific example 4.
FIG. 16 is an explanatory diagram of a dictionary database of a specific example 4.
FIG. 17 is a flowchart illustrating the operation of the paraphrasing process of Example 4;
FIG. 18 is an explanatory diagram of an input text of a specific example 4.
FIG. 19 is an explanatory diagram of the contents of a work buffer according to a specific example 4.
FIG. 20 is a configuration diagram of a specific example 5;
FIG. 21 is a flowchart showing the operation of the paraphrase processing of the specific example 5.
[Explanation of symbols]
10, 10a I / O device
21 Paraphrase information setting section
22, 25, 26 conversion unit
23 Proficiency Learning Department
31, 31a Dictionary database
32 User information storage table

Claims

A dictionary database that associates the headword with a paraphrase that expresses the headword in another expression,
A paraphrase system comprising: a conversion unit that refers to the dictionary database for an arbitrary phrase and, when there is a matching headword, converts the phrase into a paraphrase and outputs the paraphrase.

The paraphrasing system according to claim 1,
A user information storage table for each user indicating whether or not to paraphrase the headword,
A paraphrase system, comprising: a conversion unit that converts an arbitrary phrase into a paraphrase based on user information corresponding to the user when the user is designated.

In the paraphrase system according to claim 2,
A paraphrase system comprising: a paraphrase information setting unit that, when a user specifies whether or not to paraphrase a headword, reflects the specified content in the user information storage table.

In the paraphrase system according to claim 2 or 3,
A paraphrase characterized in that when the word in the document given by the user matches a headword in the dictionary database, the user information storage table includes a proficiency learning unit that sets the headword to be not paraphrased. system.

In the paraphrase system according to claim 2 or 3,
If the word in the document given by the user matches the paraphrase in the dictionary database, the paraphrase characterized in that the user information storage table is provided with a proficiency learning unit that sets the headword as paraphrase required. system.

In the paraphrase system according to any one of claims 1 to 5,
An input / output device connected to the dictionary database and the conversion unit via a communication line is provided,
A paraphrase system, wherein the input / output device is configured to transmit an arbitrary phrase together with a paraphrase request to the conversion unit, and to receive a paraphrase result from the conversion unit.

In the paraphrase system according to any one of claims 1 to 6,
A dictionary database with paraphrases corresponding to conditions based on the contextual relationship of the headword,
For any phrase, based on information indicating the contextual relationship of the phrase, refer to the dictionary database, and if there is a headword whose information satisfies the condition based on the contextual relationship, A paraphrase system comprising a conversion unit for converting a paraphrase corresponding to a headword.

In the paraphrase system according to any one of claims 1 to 7,
A paraphrase system comprising: a conversion unit configured to morphologically analyze an input character string to extract words and convert a headword into a paraphrase based on the extracted words.

In the paraphrase system according to any one of claims 1 to 7,
A character string is added one character at a time from an arbitrary character in the document, the character string is compared with a headword, and when a match is found, a conversion unit that determines that the character string is a phrase to be paraphrased is provided. Paraphrase system characterized by the following.

In the paraphrase system according to any one of claims 1 to 9,
A paraphrase system characterized in that a portion converted into a paraphrase is displayed differently from a portion not converted.