Introduction to Apache Solr
Son Le - IBM Technical Evangelist
E: sonle@sg.ibm.com
T: @thsonvt
Agenda
Big Data Landscape
Macro Trends Driving NoSQL Technology
Introduction
Source: Scaling Apache Solr (Safaribookonline.com)
Apache Solr Architecture
Source: Solr In Action (Safaribookonline.com)
Apache Solr Main Benefits
Apache Solr Main Benefits
Apache Solr Key Features
Apache Solr Key Features
Apache Solr Key Features
Overview of Indexing Process
Source: Solr in Action (safaribooksonline.com)
Data Model in Apache Solr
Document
The basic and atomic unit of information in Solr. It is a container of fields and values that belong to a given entity of your domain model (for example, a book, car, or person).
Data Model in Apache Solr
Inverted Index
Data Model in Apache Solr
Field Type
One of the top-level entities declared in Solr schemas. A field type is declared using the <fieldType> element
Data Model in Apache Solr
Tokenizer
Breaks an incoming character stream into one or more tokens depending on specific criteria
Example: "I'm writing a simple text" as the input text
Data Model in Apache Solr
Analyzer
Examines the text of fields and generates a token stream