architecture challenges of search

37
Architecture challenges of search. Torsten Bøgh Köster (Shopping24) University of Hamburg, November, 14th 2011

Upload: torsten-koester

Post on 19-Jun-2015

275 views

Category:

Technology


2 download

TRANSCRIPT

Page 1: Architecture challenges of search

Architecture challenges of search.

Torsten Bøgh Köster (Shopping24) University of Hamburg, November, 14th 2011

Page 2: Architecture challenges of search

Inhalt‣ Ich & mein Brötchengeber‣ Von Totalausfällen und roten Kleidern‣ Von „Pimping“ und „Tweaking“‣ Authentifizierung und Caching‣ Von Flüssen und Klötzen

Page 3: Architecture challenges of search

@tboeghk

‣ Software- and systems- architect‣ Studied at University of Hamburg (class of 2005)‣ Thesis on geo-spatial search‣ 4y experience with Solr / Lucene‣ 5y software architect at Deutscher Ring insurance developing customer and

sales internet / intranet portals‣ 2y system administrator & PHP/Perl hacker

‣ Currently team of 8 Java developers at Shopping24

Page 4: Architecture challenges of search

shopping24 internet group

Page 5: Architecture challenges of search
Page 6: Architecture challenges of search

google legt die messlatte ...

Page 7: Architecture challenges of search

... sehr hoch

Page 8: Architecture challenges of search

suche hat viele facetten

Page 9: Architecture challenges of search

Suche bei shopping24

Page 10: Architecture challenges of search

apache solr

Page 11: Architecture challenges of search

aktuelle sucharchitektur

Page 12: Architecture challenges of search
Page 13: Architecture challenges of search

Eine neue API!

Page 14: Architecture challenges of search

Rahmenbedingungen‣ Jeder Aspekt ein Deployable‣Caching‣ Authentifizierung‣ Vollständige fachliche Kapselung‣ Suchmaschine austauschbar‣ 1 Request <> 1 Portalseite‣ 100ms

Page 15: Architecture challenges of search

Inhalt‣ Ich & mein Brötchengeber‣ Von Totalausfällen und roten Kleidern‣ Von „Pimping“ und „Tweaking“‣ Authentifizierung und Caching‣ Von Flüssen und Klötzen

Page 16: Architecture challenges of search

von totalausfällen und roten kleidern

Page 17: Architecture challenges of search
Page 18: Architecture challenges of search
Page 19: Architecture challenges of search

Precision & Recall

Page 20: Architecture challenges of search

Suchqualität manuell ermitteln

Page 21: Architecture challenges of search

Entwicklungserfolg wird messbar

Page 22: Architecture challenges of search

Zugriffszeitenminimieren

Page 23: Architecture challenges of search

highly effective: data sharding ...

Page 24: Architecture challenges of search

let “sharding“ follow your data ...

Page 25: Architecture challenges of search

... and build separate cores for every client.

Page 26: Architecture challenges of search

Duplicate data as long as access is fast.

andybahn / photocase.com

Page 27: Architecture challenges of search

Authentifizierung vs. Caching ...

Page 28: Architecture challenges of search

... gelöst durch transparente Infrastrukturkomponenten

Page 29: Architecture challenges of search

Datenfluss zur Suchmaschine

Page 30: Architecture challenges of search

Daten werden von Klotz zu Klotz geworfen

Page 31: Architecture challenges of search

Extraktion eines denormalisierten Sortiments

Page 32: Architecture challenges of search

Wozu brauche ich jetzt nochmal einen Search Consultant?

Page 33: Architecture challenges of search

Dedizierte Suchcores - dedizierte ETL-Prozesse

Page 34: Architecture challenges of search

Alle Daten maximal vorverarbeiten

Page 35: Architecture challenges of search

das auge entwickelt mit: umfassende doku muss nicht hässlich sein.

Page 36: Architecture challenges of search

zweifel, ein treuer begleiter: transparenz von anfang an schaffen

Page 37: Architecture challenges of search

Questions? Comments?

‣ Twitter: @tboeghk‣ Github: @tboeghk‣ Email: [email protected]

‣ Web: s24.com‣ Developer (old): developer.smatch.com‣ Developer (new): developer.s24.com

Images: sxc.hu (unless noted otherwise)