Skip to content

Yahoo search bug

None of the search engines can handle Indian languages very well. Google removes the zero width joiners, non joiners , that are used in many languages. Yahoo doesnot remove it. But a UI bug in webpage makes the results wrong..
See the below image:

The bottom half of the image is the source code. We can clearly see that the closing bold tag is placed in between the word instead of putting at the end of the word. As a result, the word is rendered wrong in the page.
This happens for all languages which use ZWJ, ZWNJ, ZWS etc. It breaks the word just before the zwnj/zwj and puts the end of bold tag to highlight the search result..

I showed this to Gopal and told me that he filed a bug on that.

{ 4 } Comments

  1. unmadindu | December 5, 2008 at 3:45 PM | Permalink


    There may be some issues related to Normalization as well – I’ll blog about that some day.

  2. Anonymous | December 6, 2008 at 6:43 AM | Permalink

    Re: Normalization

    Totally askew conversation, but I tend to see more such web experience ugliness crop up along with random side discussions. Isn’t there a way to collect all this aside from blogging ?


  3. Anonymous | December 14, 2008 at 8:02 AM | Permalink

    Re: Normalization

    Sure. Just after you find a place where you could collect all your beauty tips – aside from commenting on his blog.

    btw…Peter Norvig in this nice talk, brings in some insight as to how segmentation of unicode text works while crawlers parse them.

  4. Anonymous | December 14, 2008 at 8:03 AM | Permalink

    Re: Normalization

    ah, the link …

Post a Comment

Your email is never published nor shared. Required fields are marked *

Powered by WP Hashcash