| Jaap Kamps | Jussi Karlgren | Ralf Schenkel |
| University of Amsterdam | SICS, Stockholm | MPI, Saarland University |

The 3d ESAIR workshop was held on October 30, 2010 in Toronto in conjunction with the 19th CIKM conference. The topic of the workshop was semantic annotations by which we refer to linguistic annotations (such as named entities, semantic classes or roles, etc.) as well as user annotations (such as microformats, RDF, tags, etc.) and other related information additions to textual or other objects. With the advent of several new and robust analysis tools for text, speech, and video; more structured data sources on the web, thanks to modern web languages; large numbers of services which provide support for user annotation and tagging we can expect new application areas for this new, deeper, and enhanced analysis of information. The CIKM conference keynotes by Jamie Callan ("Search engine support for software applications"), Gregory Grefenstette ("Use of semantics in real life applications"), and Sue Dumais ("Temporal dynamics and information retrieval") touched upon this issue, as did several presentations and posters at the conference. We can expect the launch of new tools and applications to exploit semantic cues profitably in the near future.
The aim of this workshop was not to discuss the craft of semantic annotation itself, but rather the applications of semantic annotation to information access tasks on various levels of abstraction such as ad-hoc retrieval, classification, browsing, textual mining, summarization, question answering, etc. The program included two keynote talks, seventeen poster presentations, three break-out discussion groups and a final discussion session followed by a Saturday night social event in a nearby Toronto establishment.
The previous two editions of ESAIR, organised by Hugo Zaragoza and Omar Alonso, provided a basis of fruitful discussion as a point of departure:
The first two ESAIR workshops were exploratory workshops to discuss the research space around the topic. A selection of results from them are published in the journal Information Processing and Management (IPM), Volume 46, issue 4, (July 2010). This third edition of the workshop discussed various aspects of application of semantic annotations in practice.
The proceedings of this workshop are available from the ACM Digital Library and are indexed in the DBLP Computer Science Bibliography.
The call for participation in the workshop included a set of challenge questions to focus and guide the discussion. These questions were addressed in one break-out group each.
In the realm of Information Retrieval, why is Semantic Annotation needed? What has changed? Is it the users, the sources, the genres, the technologies, the applications, the queries? If there are differences, why and how can Semantic Annotation help? And more specifically, how and what is Natural Language Processing (NLP) contributing? This talk will share some practical use cases of where and how NLP-based semantic annotation has demonstrated its utility (or a solid promise of utility) in some information access tasks, by virtue of the depth and richness of annotation possible with today's more powerful NLP technologies. What is showing promise is the ability to understand how to utilize the higher levels of language processing to do Semantic Annotation. This is largely through the introduction of the Pragmatic level of language processing - the functional perspective which provides the extra understanding that comes from the study of language in actual use. Pragmatics is concerned with the aspects of language which require context to be understood. Basically, how situational context is lexicalized and grammaticalized. In Pragmatics, the goal is to recognize the extra meaning that humans read into utterances, which other levels of language processing have not recognized as being encoded in them. Semantic Annotation can then go the next step and amplify current annotations with this additional contextual & intentional knowledge. Examples will be shown of what is being done with NLP today that couldn't be done, or simply wasn't being done in earlier days of IR. In applications of keenest interest today, there is an increased relative emphasis focus on dialogue, interaction, real-time, social, and exploratory search, where understanding the user's intent or plan in their query is key. Applications in exploratory search, eDiscovery, sentiment recognition, collaborative search, along with very, very large scale medical insurance consumer applications will be considered in terms of how and what Semantic Annotations can improve by adding more advanced levels of Natural Language Processing.
We compare the costs of semantic annotation of textual documents to its benefits for information processing tasks. Semantic annotation can improve the performance of retrieval tasks and facilitates an improved search experience through faceted search, focused retrieval, better document summaries, and result grouping. Applications which summarize large collections of text or explain real world phenomena based on textual evidence may receive even more benefit from semantic annotations. Semantic annotation creates surplus value if the annotated data can be used beyond any foreseen application. In particular by third parties linking your data by means of your semantic markup to other data with similar markup. We present a list of properties of the annotated data which optimize this surplus value. They are derived from the principle which states that annotation should facilitate the reuse of data in a mashup without information being lost or distorted. For the Dutch House of Parliament we annotated the parliamentary proceedings based on this principle. Concrete examples from this data collection will illustrate the surplus value enhancing properties.