Yahoo search bug

None of the search engines can handle Indian languages very well. Google removes the zero width joiners, non joiners , that are used in many languages. Yahoo doesnot remove it. But a UI bug in webpage makes the results wrong..
See the below image:

The bottom half of the image is the source code. We can clearly see that the closing bold tag is placed in between the word instead of putting at the end of the word. As a result, the word is rendered wrong in the page.
This happens for all languages which use ZWJ, ZWNJ, ZWS etc. It breaks the word just before the zwnj/zwj and puts the end of bold tag to highlight the search result..

I showed this to Gopal and told me that he filed a bug on that.

4 thoughts on “Yahoo search bug”

    1. Re: Normalization

      Totally askew conversation, but I tend to see more such web experience ugliness crop up along with random side discussions. Isn’t there a way to collect all this aside from blogging ?


      1. Re: Normalization

        Sure. Just after you find a place where you could collect all your beauty tips – aside from commenting on his blog.

        btw…Peter Norvig in this nice talk, brings in some insight as to how segmentation of unicode text works while crawlers parse them.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.