Many companies and research agencies are conducting various surveys to research customer satisfaction and market trends, in order to become more customer-oriented. Surveys generally include multiple-choice questions (providing quantitative results) and boxes to be freely filled in, (providing qualitative results). Given the increasingly diverse preferences of customers, the qualitative information obtained from comments written in the fill-in boxes and those posted on websites is becoming more important. However, to analyze this information, the person in charge must review the comments one by one and understand the intended meaning. This process not only requires a considerable amount of time, but also causes the results to vary according to each person's experience and knowledge when analyzed by multiple people.
To solve this problem, Fuji Xerox is researching and developing natural language processing technology that can effectively collect and analyze the text data written in surveys and other studies by using computer analysis in addition to human interpretation.
By using this technology, a massive amount of text data can be automatically categorized and organized in the form of comprehensive numerical values and charts as indicated below. These charts provide a quantitative overview and enable trends and changes in the entire market to be easily communicated to others. The quantitative/visual data that is automatically generated also enables the analyst to concentrate more thoroughly on analysis.
The following describes this technology by using a usage example.

Example of a survey: "Tell us your thoughts about how to use cell-phones."
Flow of the analysis is described below.

![]()
Fig. 1, "Topic Graph," Fig. 2, "Percentage Difference Graph (with numerical values)," and Table 1, "Modification Frequency," show meaning chunks contained in the database. From these figures and table, analysts can obtain the information described below.


