Published online by Cambridge University Press: 05 June 2015
As discussed in Chapter 2, in most sentiment analysis applications, one needs to study opinions from many people because due to the subjective nature of opinions, looking at only the opinion from a single person is usually not sufficient. To understand a large number of opinions, some form of summary is necessary. Definition 2.14 in Section 2.2 defined a structured opinion summary called aspect-based summary, also known as feature-based summary in Hu and Liu (2004) and Liu et al. (2005). Much of the opinion summarization research is based on this definition. This form of summary is also widely used in industry. For example, both Microsoft Bing and Google Product Search use this form of summary in their opinion analysis systems.
In general, opinion summarization can be seen as a kind of multidocument text summarization. Traditional text summarization has been studied extensively in NLP (Das, 2007). However, an opinion summary is quite different from a conventional single document or multidocument summary (of factual information). The reason is that an opinion summary should (1) be centered on entities and aspects and sentiments about them and (2) be quantitative. Traditional single document summarization produces a short document from a long document by extracting some “important” sentences, while traditional multidocument summarization finds differences among documents and discards repeated information. Neither of them explicitly captures different topics/entities and their aspects discussed in the documents, nor do they have a quantitative perspective. The “importance” of a sentence in traditional text summarization is typically defined operationally based on the summarization algorithms and measures used in each system. Opinion summary, on the other hand, can be defined formally in a structured form and represented as structured objects (see Definition 2.14). Even for output opinion summaries that are short text documents, there should still be explicit structures in them.
After discussing summarization, we move the topic of opinion search or retrieval in this chapter.
To save this book to your Kindle, first ensure no-reply@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
Find out more about the Kindle Personal Document Service.
To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Dropbox.
To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to Google Drive.