mark: A photo of Mark kneeling on top of the Taal Volcano in the Philippines. It was a long hike. (Default)
Mark Smith ([staff profile] mark) wrote in [site community profile] dw_maintenance2013-02-17 12:45 pm

Search changes

Hi all,

I just rolled out some search changes. The two main things you should notice: search indexing is much quicker now (it updates with new content every 15 minutes) and also now comments (on paid accounts) are indexed.

Thank you for all of your patience with search. We outgrew our old trusty search machine and it's now retired. The new one is much larger, has faster disks, and should hold us in good standing for a while as we grow.

I'll be around all day and watching this post. Please let me know if you see any problems!
victor_sudakov: (Default)

[personal profile] victor_sudakov 2013-02-18 02:27 pm (UTC)(link)
It still does not search Cyrillic text. What a shame.
alierak: (Default)

[personal profile] alierak 2013-02-18 05:20 pm (UTC)(link)
Looks like Sphinx can do this but we've left Cyrillic chars out of our charset_table. I guess we would need to add something like "U+410..U+42F->U+430..U+44F, U+430..U+44F" and then reindex everything? [staff profile] mark?
victor_sudakov: (Default)

[personal profile] victor_sudakov 2013-02-19 03:16 am (UTC)(link)
Sphinx most certainly can do Cyrillic searches, at least sphinxsearch-devel-2.0.1b_1 on FreeBSD does that for me.
victor_sudakov: (Default)

[personal profile] victor_sudakov 2013-02-19 04:22 am (UTC)(link)
It seems to work now with one limitation: it searches only for exact matches, not word forms. I also tried searching with wildcards, it does not work either.

Even in this way, it would be a great improvement.
Edited 2013-02-19 04:23 (UTC)
victor_sudakov: (Default)

[personal profile] victor_sudakov 2013-02-19 06:21 am (UTC)(link)
There is no difference, sorry to say. It still finds only exact matches for Cyrillic text.
victor_sudakov: (Default)

[personal profile] victor_sudakov 2013-02-19 06:40 am (UTC)(link)
OK, take the initial form (Nominative case, singular) of the word общение (meaning communication) and try to search for its different forms like общения общению общением общении

This word is present here: http://victor-sudakov.dreamwidth.org/162749.html in the prepositional case.
victor_sudakov: (Default)

[personal profile] victor_sudakov 2013-02-19 05:06 pm (UTC)(link)
Hurrah, it works! Thank you Mark!