Free Text Search

Quick Tour

Gartoo HOWTO

  • Queries are analysed using Nestoria/Lokku::NLP
  • Handles '2 bedrooms' and strips out stopwords
  • Checked against LME (Link Management Engine (?))
  • SERP is constructed by Lokku::Gartoo::Search::ListingSearcher
  • Scoring/sorting is done by passing Sphinx algorithm

Gartoo

Matches are highlighted

Gartoo

Only results matching all terms are returned

Nestoria

Returns matches from full_description/keywords

Not obvious why listings are matched

Nestoria

Lots of filtering options

Bilatu

Not so much

Nestoria 'free text search'

Requirements ?

Requirements

  • Allow Free Text Searches
  • Provide cross links to 'similar' searches
  • Run alongside current SIM/geo searches

BIG Open Questions

Limit to geolocatable areas?

  • Will simplify architecture
  • If we don't, what do we do with the map

BIG Open Questions

Concurrent old/new searches

  • How do we allow the user to do geo searches AND free text
  • Two search boxes?
  • Do we (frontend) know at search time which is occuring

Text

BIG Open Questions

Localdata/Metadata/Facets

  • This is very specific SIM code
  • Replicating will take time

BIG Open Questions

Deduplication/Scoring

  • SIM specific functionality
  • Backend databases do not support deduplication as we know it
  • What does Gartoo do?
  • If we keep our current architecture, this will mean running 2 simultaneous SI builds
  • Syncing out twice as much data on each build

BIG Open Questions

Hardware

  • We will run alongside our current site
  • Gartoo currently uses 4 servers for a low throughput site (?)
  • How many extra servers are we expecting?

Small Open Questions

URL formats

  • What formats are we expecting

 

/london-flat-rent-balcony

/houses-in-london-with-gardens-and-a-swimming-pool

/I-totes-loves-Nestoria-init

 

Small Open Questions

How do we search?

  • Match some or all terms?
  • Handle '2 bedroom' type queries?
  • Match against what? (full_description/keywords/address)
  • Aim to highlight results in SERP?

 

Architecture

Build new 'search-api'

frontend-static

frontend

search-api

free-text

Utilise SIM 

frontend-static

frontend

search-api

free-text

sim

Try using sim for facets/metadata

Free Text Search

By chris_lokku

Free Text Search

  • 364