FAST Enterprise Search Platform
Version 5.2
Konzepte
Advanced Linguistics Guide
PDF-Datei "SP_Advanced_Linguistics_Guide.pdf"
.../share/Doku/Handbuecher/FAST/FAST-ESP-5.2-Doku/ESP_Advanced_Linguistics_Guide.pdf
Kapitel:
- 1: Linguistics in FAST ESP
- 2: Language and Encoding Detection
- 3: Tokenization and Normalization
- 4: Lemmatization
- 5: Spellchecking Framework
- 6: Anti-Phrasing and Stop Words
- 7: Synonyms
- 8: Entity Extraction
- 9: Noun Phrase Extraction
- 10: Vectorizer
- 11: Structural Analysis (STAN)
- 12: Phonetic Search
- 13: Offensive Content Filter
- 14: Languages requiring special tokenization (CJK+)
- 15: Dictionaries
- 16: Dictionary Maintenance with Dictman