TL;DR Since version 23.0.0 , Manticore can make searches like xt850 match xt 850 using bigram_delimiter together with digit-aware bigram_index modes. This solves a common tokenization mismatch in product search, where users remove spaces from model names but the source data stores them as separate tokens. Assumptions and verification This article assumes: RT tables created with SQL examples exactly as shown default tokenization unless the example explicitly changes a setting ASCII digits in mode
Comment
Sign in to join the discussion.
Loading comments…