Anyone know of any good texts on indexing with a view to fast, fuzzy searching over a pretty huge database? All I seem to be able to turn up are texts on the mathematics of fuzzy logic which, to be honest, is quite a bit too low level.
Something that springs to mind for me is agrep. I'm also thinking of Lucence (e.g., http://lucene.apache.org/java/docs/queryparsersyntax.html), which for its fuzzy searching uses the Levenshtein distance to decide whether a match is "good enough".
Ah, cool, cheers weedpacket. I feel a bit of a fool actually (not the first time) because we're using lucene here in the office. The thing was, we're not doing fuzzy searches so I'd just assumed it didn't support them. I'll have a look into it.