Annotation Search Engine

Alexander Savev

Svetlin Slavov

Tsvetan Dimitrov

Agenda

  • Motivation

  • Application architecture

  • Used technologies

  1. Crawl articles without any structure (free text)

  2. Annotate the articles

  3. Store annotated articles in an index

  4. Search based on structural and contextual information from the annotation process

    1. Search results show text with colored (highlighted) annotations

Motivation

Application architecture

Used Technologies

  • Crawler - Python

  • Annotator - Ontotext S4 (text analytics cloud service)

  • Indexer - Java, Lucene

  • Web Application Frontend - Angular 2

  • Web Application Backend - Java, Spring BootÂ