<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Santhosh Thottingal &#187; Misc</title>
	<atom:link href="http://thottingal.in/blog/category/uncategorized/feed/" rel="self" type="application/rss+xml" />
	<link>http://thottingal.in/blog</link>
	<description>/home/santhosh</description>
	<lastBuildDate>Mon, 14 Nov 2011 06:06:25 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Malayalam Wikisource Offline version</title>
		<link>http://thottingal.in/blog/2011/06/11/malayalam-wikisource-offline-version/</link>
		<comments>http://thottingal.in/blog/2011/06/11/malayalam-wikisource-offline-version/#comments</comments>
		<pubDate>Sat, 11 Jun 2011 09:11:38 +0000</pubDate>
		<dc:creator>Santhosh</dc:creator>
				<category><![CDATA[Community]]></category>
		<category><![CDATA[Indic]]></category>
		<category><![CDATA[Malayalam]]></category>
		<category><![CDATA[Misc]]></category>
		<category><![CDATA[Projects]]></category>
		<category><![CDATA[wikipedia]]></category>

		<guid isPermaLink="false">http://thottingal.in/blog/?p=358</guid>
		<description><![CDATA[Malayalam Wikisource community today released the first offline version of Malayalam wikisource during the 4th annual wiki meetup of Malayalam wikimedians. To the  best of our knowledge, this is the first time a wikisource project release its offline version. Malayalam wiki community had released the first version of Malayalam wikipedia one year back. Releasing the [...]]]></description>
			<content:encoded><![CDATA[<p>Malayalam Wikisource community today released the first offline version of <a href="http://ml.wikisource.org" target="_blank">Malayalam wikisource</a> during the 4th annual wiki meetup of Malayalam wikimedians. To the  best of our knowledge, this is the first time a wikisource project release its offline version. Malayalam wiki community <a href="http://thottingal.in/blog/2010/04/17/mlwikioncd/" target="_blank">had released</a> the first version of Malayalam wikipedia one year back.</p>
<p>Releasing the offline version of a wikisource is a challenging project. The technical aspects of the project was designed and implemented by myself. So let me share the details of the project.</p>
<p>As you know a Wikisource contains lot of books, and each book varies in its size, it is divided to chapters or sections. There is no common pattern for books. Each having its own structure. A novel presentation is different from a collection of poems from a Poet. Wikisource also has religious books like Bible, Quran, Bhagavat Geeta, Ramayana etc.  Since books are for continuous reading for a long time, the readabilty and how we present the lengthy chapters in screen also matters. Offline wikipedia tools for example, <a href="http://www.kiwix.org/" target="_blank">Kiwix</a> does not do any layout modification of the content and present as it is shown in wikipedia/wikisource. <a href="https://github.com/santhoshtr/wiki2cd" target="_blank">The tool</a> we wrote last year for Malayalam wikipedia offline version also present scrollable vertical content in the screen. Both are not configurable to give different presentation styles depending on the nature of the book.</p>
<p>What we wanted is a book reader kind of application interface.  Readers should be able to easily navigate to books, chapters. The chapter content will be very lengthy. For a long time reading of this content,  a lengthy vertically scrolled text is not a good idea. We also need to take care of the width of the lines.  If each line spans 80-90% of the screen, especially for a wide screen monitor, it is a strain for neck and eyes.</p>
<p>&nbsp;</p>
<div id="attachment_361" class="wp-caption aligncenter" style="width: 405px"><a href="http://thottingal.in/blog/wp-content/uploads/2011/06/2011-06-09-19-29-211.png"><img class="size-large wp-image-361" title="2011-06-09-19-29-21" src="http://thottingal.in/blog/wp-content/uploads/2011/06/2011-06-09-19-29-211-1024x455.png" alt="" width="395" height="175" /></a><p class="wp-caption-text">Screenshot of Offline version. Click to enlarge</p></div>
<p style="text-align: center;"><a href="http://thottingal.in/blog/wp-content/uploads/2011/06/2011-06-09-19-29-21.png"><br />
</a></p>
<p>The selection of books for the offline version was done by the active wikimedians at Wiksource. Some of the selected books was proof read by many volunteers within the last  2 weeks.</p>
<p>The tools used for extracting htmls were adhoc and adapted to meet the good presentation of each book. So there is nothing much to reuse here. Extracting the html and then taking the content part alone using pyquery and removing some unwanted sections from html- basically this is what our scripts did. The content is added to predefined HTML templates with proper CSS for the UI. CSS3 multicolumn feature was used for book like interface. Since IE did not implement this standard even in IE9, for that browser the book like interface was not provided. Chrome browser with version less than 12 could not support, because of these bugs: <a href="http://code.google.com/p/chromium/issues/detail?id=45840">http://code.google.com/p/chromium/issues/detail?id</a><a href="http://code.google.com/p/chromium/issues/detail?id=45840">=45840</a> and <a href="http://code.google.com/p/chromium/issues/detail?id=78155">http://code.google.com/p/chromium/issues/detail?id</a><a href="http://code.google.com/p/chromium/issues/detail?id=78155">=78155</a>. For easy navigation, mouse wheel support and page navigation buttons provided. For solving non-availability of required fonts, webfonts were integrated with a selection box  to select favorite font. Reader can also select the font size to make the reading comfortable.</p>
<p>Why static html? The variety of platforms and other versions we need to support, necessity to have webfonts, complex script rendering, effort to develop and customize UI, relatively small size of the data, avoiding any installation of software in users system etc made us to choose static html+ jquery + css as the technology choice. The downside is we could not provide full text search.</p>
<p>Apart from the wikisource, we also included a collection of copyleft of images from wikimedia commons. Thanks to <a href="http://nishan-naseer.blogspot.com/" target="_blank">Nishan Naseer</a>, for preparing a gallery application using jquery. We selected 4 categories from Commons which are related to Kerala. We hope everybody will like the pictures and it will give  a small introduction to Wikimedia Commons.<br />
<a href="http://thottingal.in/blog/wp-content/uploads/2011/06/2011-06-11-09-22-06.png"><img class="aligncenter size-large wp-image-364" title="2011-06-11 09-22-06" src="http://thottingal.in/blog/wp-content/uploads/2011/06/2011-06-11-09-22-06-1024x474.png" alt="" width="453" height="209" /></a><br />
Even though the python scripts are not ready to reuse in any projects, if anybody want to have a look at it, please mail me. I am not putting it in public since the script does not make sense outside the context of each book and its existing presentation in Malayalam wikisource.</p>
<p>The CD image is available for download <a href="http://www.mlwiki.in/cdimage/mlwikisource.iso" target="_blank">here</a> and one can also browse the CD content <a href="http://www.mlwiki.in/wikisrccd" target="_blank">here</a>.</p>
<p>Thanks to Shiju Alex for coordinating this project. And thanks to all Malayalam wikisource volunteers for making this happen.  We have included poems, folk songs, devotional songs, novel, grammar book, tales, books on Hinduism, Islam-ism, Christianity, Communism, Philosophy. With this release, it becomes the biggest offline digital archive of Malayalam books.</p>
]]></content:encoded>
			<wfw:commentRss>http://thottingal.in/blog/2011/06/11/malayalam-wikisource-offline-version/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>On Machine Translation and God</title>
		<link>http://thottingal.in/blog/2009/06/27/on-machine-translation-and-god/</link>
		<comments>http://thottingal.in/blog/2009/06/27/on-machine-translation-and-god/#comments</comments>
		<pubDate>Sat, 27 Jun 2009 04:58:31 +0000</pubDate>
		<dc:creator>Santhosh</dc:creator>
				<category><![CDATA[Misc]]></category>

		<guid isPermaLink="false">http://thottingal.in/blog/?p=174</guid>
		<description><![CDATA[I was reading an article named &#8220;Why Can&#8217;t a Computer Translate More Like a Person?&#8221; by Alan K. Melby. The article is about the challenges that machine translation technology face to reach a acceptable quality of translation. He explains the importance of culture sensitivity required for machine translation programs. Article lists a number of examples [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: justify;">I was reading an article named <a href="http://www.ttt.org/theory/barker.html">&#8220;Why Can&#8217;t a Computer Translate More Like a Person?&#8221;</a> by Alan K. Melby. The article is about  the challenges that machine translation technology face to reach a acceptable quality of translation. He explains the importance of culture sensitivity required for machine translation programs. Article lists a  number of examples where MT can go wrong if context , culture etc are not taken into consideration.  There are very interesting arguments about how <a href="http://en.wikipedia.org/wiki/Reductionism">reductionalism</a> becomes a wrong choice while designing MT. If you are interested in  natural language processing or machine translation and wondering if there is any limit for computer programs to reach human&#8217;s language capabilities, please read it.</p>
<p style="text-align: justify;">The article is written long time back, and Machine Translation technologies improved a lot. There are commercial as well as free translation products for many languages. There are research going on in intra-indic as well as english-indic translations.  I am not sure how far these technologies solved the challenges mentioned in the above mentioned article, but I believe that the questions are still valid.</p>
<p>The question is whether the programs can understand our culture, language usage , emotions etc. For translating limited domain or dry content, the machine translation may be effective, but in a general purpose use, I don&#8217;t know <a href="http://en.wikipedia.org/wiki/Machine_translation#Major_issues">how effective they are. </a></p>
<p>Melby argues :</p>
<blockquote>
<p style="text-align: justify;">That key factor which is missing from current theories is agency. By agency, I mean the capacity to make real choices by exercising our will, ethical choices for which we are responsible. [...]. Any &#8216;choice&#8217; that is a rigid and unavoidable consequence of the circumstances is not a real choice that could have gone either way and is thus not an example of agency. A computer has no real choice in what it will do next. Its next action is an unavoidable consequence of the machine language it is executing and the values of data presented to it. I am proposing that any approach to meaning that discounts agency will amount to no more than the mechanical manipulation of symbols such as words, that is, moving words around and linking them together in various ways instead of understanding them. Computers can already manipulate symbols. In fact, that is what they mostly do. But manipulating symbols does not give them agency and it will not let them handle language like humans. Symbol manipulation works only within a specific domain, and any attempt to move beyond a domain through symbol manipulation is doomed, for manipulation of symbols involves no true surprises, only the strict application of rules. General vocabulary, as we have seen, involves true surprises that could not have been predicted.</p>
</blockquote>
<p style="text-align: justify;">With all these advanced technologies, can we develop a universal , any-to-any language translation program? We have seen many examples where human beings are <a href="http://www.ojohaven.com/fun/translation.funnies.html">failing</a> <a href="http://www.flickr.com/groups/chinglish/pool/">miserably</a> in sensible translation. If you are looking for  english-&gt;hindi translation effectiveness, <a href="http://translate.google.com/translate_t?text=%E0%A4%86%E0%A4%AA%20%E0%A4%B9%E0%A4%BF%E0%A4%A8%E0%A5%8D%E0%A4%A6%E0%A5%80%20%E0%A4%B8%E0%A4%AE%E0%A4%9D%E0%A4%A4%E0%A5%87%20%E0%A4%B9%E0%A5%88%20?#hi|en|%E0%A4%86%E0%A4%AA%20%E0%A4%B9%E0%A4%BF%E0%A4%A8%E0%A5%8D%E0%A4%A6%E0%A5%80%20%E0%A4%B8%E0%A4%AE%E0%A4%9D%E0%A4%A4%E0%A5%87%20%E0%A4%B9%E0%A5%88%20%3F">try this using google Translation</a></p>
<blockquote><p>आप <span style="text-decoration: underline;">हिन्दी</span> समझते है ? ==&gt; You understand <span style="text-decoration: underline;">English</span>?</p></blockquote>
<p style="text-align: justify;">So do you think that if there is any such <a href="http://en.wikipedia.org/wiki/Universal_translator">universal translation tool</a>,  it is nearly impossible and &#8220;only god can create such a tool&#8221; ?! . Heard about <a href="http://en.wikipedia.org/wiki/Races_and_species_in_The_Hitchhiker%27s_Guide_to_the_Galaxy#Babel_fish">Babel fish</a> (of The Hitchhiker&#8217;s Guide to the Galaxy)? .  The babel fish<strong><strong> </strong></strong> is small, yellow, leech-like, and is a <a title="Universal translator" href="http://en.wikipedia.org/wiki/Universal_translator">universal translator</a> which simultaneously translates from one spoken language to another. When inserted into the ear, its nutrition processes convert sound waves into brain waves, neatly crossing the language divide between any species you should happen to meet whilst travelling in space. According to the <a href="http://en.wikipedia.org/wiki/The_Hitchhiker%27s_Guide_to_the_Galaxy"><em>Hitchhiker&#8217;s Guide</em></a>, the Babel fish was put forth as an example for the <em>non</em>-existence of  God: .</p>
<blockquote><p><em>&#8220;I refuse to prove that I exist,&#8221; says God, &#8220;for  proof denies  faith, and without faith I am nothing.&#8221;</em><br />
<em></em></p>
<p><em>&#8220;But,&#8221; says Man, &#8220;the Babel fish is a dead giveaway isn&#8217;t it? It could not have  evolved by chance. It proves that you exist, and so therefore, by your own arguments, you don&#8217;t.  Q.E.D. </em></p>
<p><em></em><em>&#8220;Oh dear,&#8221; says God, &#8220;I hadn&#8217;t thought of that,&#8221; and promptly vanishes in a puff of  logic</em></p></blockquote>
<p>Alan K Melby argues that Douglas Adams was saying that there can&#8217;t be any such fish.</p>
<blockquote>
<p style="text-align: justify;">The silliness of the above argument is intended, I believe, to show the futility of trying to prove the existence of God, through physics or any other route. Belief in God is a starting point, not a conclusion. If it were a conclusion, then that conclusion would have to be based on something else that is firmer than our belief in God. If that something else forces everyone to believe in God, then faith is denied. If that something else does not force us to believe in God, then it may not be a sufficiently solid foundation for our belief.</p>
<p style="text-align: justify;">Adams may also be saying something about translation and the nature of language. I can speculate on what Adams had in mind to say about translation when he dreamed up the Babel fish. My own bias would have him saying indirectly that there could be no such fish since there is no universal set of thought patterns underlying all languages. Even with direct brain to brain communication, we would still need shared concepts in order to communicate. Words do not really fail us. If two people share a concept, they can eventually agree on a word to express it. Ineffable experiences are those that are not shared by others.</p>
</blockquote>
<p style="text-align: justify;">I have some friends studying on machine translation with Indian Languages. They are evaluating shallow transfer method(Statistical methods to the words surrounding the ambiguous word.) for this using tools like <a href="http://www.apertium.org/">apertium</a>. Let us hope that they will succeed in their efforts.</p>
<p>Let me give one example translation between Tamil and Malayalam where context matters.</p>
<p style="text-align: justify;">In Malayalam, for &#8216;wait, wait&#8217;, we usually say, &#8220;നില്ക്കു് നില്ക്ക്&#8221;(Literal meaning:  &#8216;stand, stand&#8217; ) . For the same purpose , I have noticed that my Tamil speaking friends  use &#8220;இரு இரு&#8221; (Literal meaning: &#8216;sit, sit&#8217;  ). Now if the translation is done without knowing this usage, it is going to be funny. Shallow transfer methods use multiple intermediate  languages for translation. For eg: If there is a translation  tool available for a-&gt;b and b-&gt;c and then a-&gt;c is possible through a-&gt;b-&gt;c . I feel that this is going to be a big challenge.. to keep the word meaning, context, common usage&#8230;etc.. Let us <strong>wait/sit/stand</strong> and see <img src='http://thottingal.in/blog/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' /> </p>
<p>Since we saw &#8220;a nonexistence of God proof&#8221;, let me give another one, that I read sometime back.</p>
<ol>
<li>God is so powerful, he can do any thing,</li>
<li>God can create anything , if #1 is true</li>
<li>If #2 is is true, he can create a big stone that he cannot lift!</li>
<li>If he cannot lift a stone, then #1 is wrong, hence #2 also wrong. So God does not exist!</li>
</ol>
<p>Looks very silly, right? or &#8220;Logical&#8221; ? <img src='http://thottingal.in/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://thottingal.in/blog/2009/06/27/on-machine-translation-and-god/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>PDFBox : Extract Text from PDF</title>
		<link>http://thottingal.in/blog/2009/06/24/pdfbox-extract-text-from-pdf/</link>
		<comments>http://thottingal.in/blog/2009/06/24/pdfbox-extract-text-from-pdf/#comments</comments>
		<pubDate>Wed, 24 Jun 2009 04:40:34 +0000</pubDate>
		<dc:creator>Santhosh</dc:creator>
				<category><![CDATA[Misc]]></category>
		<category><![CDATA[java]]></category>
		<category><![CDATA[pdf]]></category>

		<guid isPermaLink="false">http://thottingal.in/blog/?p=168</guid>
		<description><![CDATA[Recently I had to extract text from PDF files for indexing the content using Apache Lucene. Apache PDFBox was the obvious choice for the java library to be used. Apache PDFBox is an opensource java library for working with PDF files. The PDFBox library allows creation of new PDF documents, manipulation of existing documents and [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: justify;">Recently I had to extract text from PDF files for indexing the content using Apache Lucene. Apache PDFBox was the obvious choice for the java library to be used.</p>
<p style="text-align: justify;"><a href="http://incubator.apache.org/pdfbox/">Apache PDFBox</a> is an opensource java library for working with PDF files. The PDFBox library allows creation of new PDF documents, manipulation of existing documents and the ability to extract content from documents. PDFBox also includes several command line utilities.</p>
<p style="text-align: justify;">There is <a href="http://incubator.apache.org/pdfbox/download.html">no latest build available</a> for PDFBox. Sourceforge has very <a href="http://sourceforge.net/project/showfiles.php?group_id=78314">old binaries</a>. But  the old version<a href="https://issues.apache.org/jira/browse/PDFBOX-361"> fails to work with PDF 1.5 specification</a>. So one need to compile the latest code from SVN. </p>
<p style="text-align: justify;">I am sharing the latest jar file built from svn <a href="http://thottingal.in/download/pdfbox/">here</a></p>
<p style="text-align: justify;">The following example explains how to extract the text from a pdf file using PDFBox.</p>

<div class="wp_codebox_msgheader"><span class="right"><sup><a href="http://www.ericbess.com/ericblog/2008/03/03/wp-codebox/#examples" target="_blank" title="WP-CodeBox HowTo?"><span style="color: #99cc00">?</span></a></sup></span><span class="left"><a href="javascript:;" onclick="javascript:showCodeTxt('p168code2'); return false;">View Code</a> JAVA</span><div class="codebox_clear"></div></div><div class="wp_codebox"><table><tr id="p1682"><td class="line_numbers"><pre>1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
</pre></td><td class="code" id="p168code2"><pre class="java" style="font-family:monospace;"><span style="color: #000000; font-weight: bold;">import</span> <span style="color: #006699;">java.io.File</span><span style="color: #339933;">;</span>
<span style="color: #000000; font-weight: bold;">import</span> <span style="color: #006699;">java.io.FileInputStream</span><span style="color: #339933;">;</span>
<span style="color: #000000; font-weight: bold;">import</span> <span style="color: #006699;">java.io.IOException</span><span style="color: #339933;">;</span>
<span style="color: #000000; font-weight: bold;">import</span> <span style="color: #006699;">org.apache.pdfbox.cos.COSDocument</span><span style="color: #339933;">;</span>
<span style="color: #000000; font-weight: bold;">import</span> <span style="color: #006699;">org.apache.pdfbox.pdfparser.PDFParser</span><span style="color: #339933;">;</span>
<span style="color: #000000; font-weight: bold;">import</span> <span style="color: #006699;">org.apache.pdfbox.pdmodel.PDDocument</span><span style="color: #339933;">;</span>
<span style="color: #000000; font-weight: bold;">import</span> <span style="color: #006699;">org.apache.pdfbox.util.PDFTextStripper</span><span style="color: #339933;">;</span>
&nbsp;
<span style="color: #000000; font-weight: bold;">public</span> <span style="color: #000000; font-weight: bold;">class</span> PDFTextParser <span style="color: #009900;">&#123;</span>
&nbsp;
	<span style="color: #666666; font-style: italic;">// Extract text from PDF Document</span>
	<span style="color: #000000; font-weight: bold;">static</span> <a href="http://www.google.com/search?hl=en&amp;q=allinurl%3Astring+java.sun.com&amp;btnI=I%27m%20Feeling%20Lucky"><span style="color: #003399;">String</span></a> pdftoText<span style="color: #009900;">&#40;</span><a href="http://www.google.com/search?hl=en&amp;q=allinurl%3Astring+java.sun.com&amp;btnI=I%27m%20Feeling%20Lucky"><span style="color: #003399;">String</span></a> fileName<span style="color: #009900;">&#41;</span> <span style="color: #009900;">&#123;</span>
		PDFParser parser<span style="color: #339933;">;</span>
		<a href="http://www.google.com/search?hl=en&amp;q=allinurl%3Astring+java.sun.com&amp;btnI=I%27m%20Feeling%20Lucky"><span style="color: #003399;">String</span></a> parsedText <span style="color: #339933;">=</span> <span style="color: #000066; font-weight: bold;">null</span><span style="color: #339933;">;;</span>
		PDFTextStripper pdfStripper <span style="color: #339933;">=</span> <span style="color: #000066; font-weight: bold;">null</span><span style="color: #339933;">;</span>
		PDDocument pdDoc <span style="color: #339933;">=</span> <span style="color: #000066; font-weight: bold;">null</span><span style="color: #339933;">;</span>
		COSDocument cosDoc <span style="color: #339933;">=</span> <span style="color: #000066; font-weight: bold;">null</span><span style="color: #339933;">;</span>
		<a href="http://www.google.com/search?hl=en&amp;q=allinurl%3Afile+java.sun.com&amp;btnI=I%27m%20Feeling%20Lucky"><span style="color: #003399;">File</span></a> file <span style="color: #339933;">=</span> <span style="color: #000000; font-weight: bold;">new</span> <a href="http://www.google.com/search?hl=en&amp;q=allinurl%3Afile+java.sun.com&amp;btnI=I%27m%20Feeling%20Lucky"><span style="color: #003399;">File</span></a><span style="color: #009900;">&#40;</span>fileName<span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
		<span style="color: #000000; font-weight: bold;">if</span> <span style="color: #009900;">&#40;</span><span style="color: #339933;">!</span>file.<span style="color: #006633;">isFile</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #009900;">&#41;</span> <span style="color: #009900;">&#123;</span>
			<a href="http://www.google.com/search?hl=en&amp;q=allinurl%3Asystem+java.sun.com&amp;btnI=I%27m%20Feeling%20Lucky"><span style="color: #003399;">System</span></a>.<span style="color: #006633;">err</span>.<span style="color: #006633;">println</span><span style="color: #009900;">&#40;</span><span style="color: #0000ff;">&quot;File &quot;</span> <span style="color: #339933;">+</span> fileName <span style="color: #339933;">+</span> <span style="color: #0000ff;">&quot; does not exist.&quot;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
			<span style="color: #000000; font-weight: bold;">return</span> <span style="color: #000066; font-weight: bold;">null</span><span style="color: #339933;">;</span>
		<span style="color: #009900;">&#125;</span>
		<span style="color: #000000; font-weight: bold;">try</span> <span style="color: #009900;">&#123;</span>
			parser <span style="color: #339933;">=</span> <span style="color: #000000; font-weight: bold;">new</span> PDFParser<span style="color: #009900;">&#40;</span><span style="color: #000000; font-weight: bold;">new</span> <a href="http://www.google.com/search?hl=en&amp;q=allinurl%3Afileinputstream+java.sun.com&amp;btnI=I%27m%20Feeling%20Lucky"><span style="color: #003399;">FileInputStream</span></a><span style="color: #009900;">&#40;</span>file<span style="color: #009900;">&#41;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
		<span style="color: #009900;">&#125;</span> <span style="color: #000000; font-weight: bold;">catch</span> <span style="color: #009900;">&#40;</span><a href="http://www.google.com/search?hl=en&amp;q=allinurl%3Aioexception+java.sun.com&amp;btnI=I%27m%20Feeling%20Lucky"><span style="color: #003399;">IOException</span></a> e<span style="color: #009900;">&#41;</span> <span style="color: #009900;">&#123;</span>
			<a href="http://www.google.com/search?hl=en&amp;q=allinurl%3Asystem+java.sun.com&amp;btnI=I%27m%20Feeling%20Lucky"><span style="color: #003399;">System</span></a>.<span style="color: #006633;">err</span>.<span style="color: #006633;">println</span><span style="color: #009900;">&#40;</span><span style="color: #0000ff;">&quot;Unable to open PDF Parser. &quot;</span> <span style="color: #339933;">+</span> e.<span style="color: #006633;">getMessage</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
			<span style="color: #000000; font-weight: bold;">return</span> <span style="color: #000066; font-weight: bold;">null</span><span style="color: #339933;">;</span>
		<span style="color: #009900;">&#125;</span>
		<span style="color: #000000; font-weight: bold;">try</span> <span style="color: #009900;">&#123;</span>
			parser.<span style="color: #006633;">parse</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
			cosDoc <span style="color: #339933;">=</span> parser.<span style="color: #006633;">getDocument</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
			pdfStripper <span style="color: #339933;">=</span> <span style="color: #000000; font-weight: bold;">new</span> PDFTextStripper<span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
			pdDoc <span style="color: #339933;">=</span> <span style="color: #000000; font-weight: bold;">new</span> PDDocument<span style="color: #009900;">&#40;</span>cosDoc<span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
			pdfStripper.<span style="color: #006633;">setStartPage</span><span style="color: #009900;">&#40;</span><span style="color: #cc66cc;">1</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
			pdfStripper.<span style="color: #006633;">setEndPage</span><span style="color: #009900;">&#40;</span><span style="color: #cc66cc;">5</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
			parsedText <span style="color: #339933;">=</span> pdfStripper.<span style="color: #006633;">getText</span><span style="color: #009900;">&#40;</span>pdDoc<span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
		<span style="color: #009900;">&#125;</span> <span style="color: #000000; font-weight: bold;">catch</span> <span style="color: #009900;">&#40;</span><a href="http://www.google.com/search?hl=en&amp;q=allinurl%3Aexception+java.sun.com&amp;btnI=I%27m%20Feeling%20Lucky"><span style="color: #003399;">Exception</span></a> e<span style="color: #009900;">&#41;</span> <span style="color: #009900;">&#123;</span>
			<a href="http://www.google.com/search?hl=en&amp;q=allinurl%3Asystem+java.sun.com&amp;btnI=I%27m%20Feeling%20Lucky"><span style="color: #003399;">System</span></a>.<span style="color: #006633;">err</span>
					.<span style="color: #006633;">println</span><span style="color: #009900;">&#40;</span><span style="color: #0000ff;">&quot;An exception occured in parsing the PDF Document.&quot;</span>
							<span style="color: #339933;">+</span> e.<span style="color: #006633;">getMessage</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
		<span style="color: #009900;">&#125;</span> <span style="color: #000000; font-weight: bold;">finally</span> <span style="color: #009900;">&#123;</span>
			<span style="color: #000000; font-weight: bold;">try</span> <span style="color: #009900;">&#123;</span>
				<span style="color: #000000; font-weight: bold;">if</span> <span style="color: #009900;">&#40;</span>cosDoc <span style="color: #339933;">!=</span> <span style="color: #000066; font-weight: bold;">null</span><span style="color: #009900;">&#41;</span>
					cosDoc.<span style="color: #006633;">close</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
				<span style="color: #000000; font-weight: bold;">if</span> <span style="color: #009900;">&#40;</span>pdDoc <span style="color: #339933;">!=</span> <span style="color: #000066; font-weight: bold;">null</span><span style="color: #009900;">&#41;</span>
					pdDoc.<span style="color: #006633;">close</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
			<span style="color: #009900;">&#125;</span> <span style="color: #000000; font-weight: bold;">catch</span> <span style="color: #009900;">&#40;</span><a href="http://www.google.com/search?hl=en&amp;q=allinurl%3Aexception+java.sun.com&amp;btnI=I%27m%20Feeling%20Lucky"><span style="color: #003399;">Exception</span></a> e<span style="color: #009900;">&#41;</span> <span style="color: #009900;">&#123;</span>
				e.<span style="color: #006633;">printStackTrace</span><span style="color: #009900;">&#40;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
			<span style="color: #009900;">&#125;</span>
		<span style="color: #009900;">&#125;</span>
		<span style="color: #000000; font-weight: bold;">return</span> parsedText<span style="color: #339933;">;</span>
	<span style="color: #009900;">&#125;</span>
	<span style="color: #000000; font-weight: bold;">public</span> <span style="color: #000000; font-weight: bold;">static</span> <span style="color: #000066; font-weight: bold;">void</span> main<span style="color: #009900;">&#40;</span><a href="http://www.google.com/search?hl=en&amp;q=allinurl%3Astring+java.sun.com&amp;btnI=I%27m%20Feeling%20Lucky"><span style="color: #003399;">String</span></a> args<span style="color: #009900;">&#91;</span><span style="color: #009900;">&#93;</span><span style="color: #009900;">&#41;</span><span style="color: #009900;">&#123;</span>
		<a href="http://www.google.com/search?hl=en&amp;q=allinurl%3Asystem+java.sun.com&amp;btnI=I%27m%20Feeling%20Lucky"><span style="color: #003399;">System</span></a>.<span style="color: #006633;">out</span>.<span style="color: #006633;">println</span><span style="color: #009900;">&#40;</span>pdftoText<span style="color: #009900;">&#40;</span><span style="color: #0000ff;">&quot;/home/santhosh/pdfbox/test.pdf&quot;</span><span style="color: #009900;">&#41;</span><span style="color: #009900;">&#41;</span><span style="color: #339933;">;</span>
	<span style="color: #009900;">&#125;</span>
&nbsp;
<span style="color: #009900;">&#125;</span>
&nbsp;
 </pre></td></tr></table></div>

<p>More details on the APIs can be read from <a href="http://incubator.apache.org/pdfbox/userguide/index.html">here</a></p>
]]></content:encoded>
			<wfw:commentRss>http://thottingal.in/blog/2009/06/24/pdfbox-extract-text-from-pdf/feed/</wfw:commentRss>
		<slash:comments>31</slash:comments>
		</item>
		<item>
		<title>In solidarity</title>
		<link>http://thottingal.in/blog/2009/05/13/in-solidarity/</link>
		<comments>http://thottingal.in/blog/2009/05/13/in-solidarity/#comments</comments>
		<pubDate>Wed, 13 May 2009 22:34:00 +0000</pubDate>
		<dc:creator>Santhosh</dc:creator>
				<category><![CDATA[Misc]]></category>
		<category><![CDATA[politics]]></category>

		<guid isPermaLink="false">http://thottingal.in/blog/?p=66</guid>
		<description><![CDATA[]]></description>
			<content:encoded><![CDATA[<p><a href="http://binayaksen.net"><img src="http://binayaksen.net/wp-content/gallery/site-graphics/have-a-heart-1.gif"/></a></p>
]]></content:encoded>
			<wfw:commentRss>http://thottingal.in/blog/2009/05/13/in-solidarity/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Youtube to MPEG or Ogg video conversion</title>
		<link>http://thottingal.in/blog/2008/11/22/youtube-to-mpeg-or-ogg-video-conversion/</link>
		<comments>http://thottingal.in/blog/2008/11/22/youtube-to-mpeg-or-ogg-video-conversion/#comments</comments>
		<pubDate>Sun, 23 Nov 2008 05:37:00 +0000</pubDate>
		<dc:creator>Santhosh</dc:creator>
				<category><![CDATA[Misc]]></category>

		<guid isPermaLink="false">http://thottingal.in/blog/?p=56</guid>
		<description><![CDATA[Here is the two line method to convert a youtube video to oggvorbis video. Locate clive and ffmpeg2theora in your package and install $clive http://in.youtube.com/watch?v=6JeZ5oeAEyU (replace this with the youtube address you want) It will create a flv file. Convert to mpeg video file $ffmpeg -i AmericaAmerica.flv AmericaAmerica.mpg Convert to ogg video file $ffmpeg2theora AmericaAmerica.mpg [...]]]></description>
			<content:encoded><![CDATA[<p>Here is the two line method to convert a youtube video to oggvorbis video.<br/><br />
Locate clive and ffmpeg2theora in your package and install<br/><br />
<code>$clive <a href="http://in.youtube.com/watch?v=6JeZ5oeAEyU">http://in.youtube.com/watch?v=6JeZ5oeAEyU </a></code>(replace this with the youtube address you want)<br />
It will create a flv file.<br/><br />
<b>Convert to mpeg video file</b><br/><br />
<code> $ffmpeg -i AmericaAmerica.flv  AmericaAmerica.mpg</code><br/><br />
<b>Convert  to ogg video file</b><br/><br />
<code>$ffmpeg2theora AmericaAmerica.mpg </code>(replace it with the name of the flv file the previous command created)<br/><br />
Done. You can see the .ogg file in the directory from where you executed the above commands<br/></p>
]]></content:encoded>
			<wfw:commentRss>http://thottingal.in/blog/2008/11/22/youtube-to-mpeg-or-ogg-video-conversion/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Apt on the Road</title>
		<link>http://thottingal.in/blog/2008/05/29/apt-on-the-road/</link>
		<comments>http://thottingal.in/blog/2008/05/29/apt-on-the-road/#comments</comments>
		<pubDate>Fri, 30 May 2008 05:51:00 +0000</pubDate>
		<dc:creator>Santhosh</dc:creator>
				<category><![CDATA[Misc]]></category>
		<category><![CDATA[apt]]></category>
		<category><![CDATA[fun]]></category>
		<category><![CDATA[picture]]></category>

		<guid isPermaLink="false">http://thottingal.in/blog/?p=43</guid>
		<description><![CDATA[Could not get a big one, since I was taking this from a bus while overtaking this vehicle.]]></description>
			<content:encoded><![CDATA[<p><a href="http://pics.livejournal.com/santhoshtr/pic/0000hpge/"><img src="http://pics.livejournal.com/santhoshtr/pic/0000hpge/s320x240" width="320" height="240" border='0'/></a></p>
<p>Could not get a big one, since I was taking this from a bus while overtaking this vehicle.</p>
]]></content:encoded>
			<wfw:commentRss>http://thottingal.in/blog/2008/05/29/apt-on-the-road/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ധ്വനി-കെ.ഡി.ഇ സംയോജനം</title>
		<link>http://thottingal.in/blog/2008/03/16/%e0%b4%a7%e0%b5%8d%e0%b4%b5%e0%b4%a8%e0%b4%bf-%e0%b4%95%e0%b5%86-%e0%b4%a1%e0%b4%bf-%e0%b4%87-%e0%b4%b8%e0%b4%82%e0%b4%af%e0%b5%8b%e0%b4%9c%e0%b4%a8%e0%b4%82/</link>
		<comments>http://thottingal.in/blog/2008/03/16/%e0%b4%a7%e0%b5%8d%e0%b4%b5%e0%b4%a8%e0%b4%bf-%e0%b4%95%e0%b5%86-%e0%b4%a1%e0%b4%bf-%e0%b4%87-%e0%b4%b8%e0%b4%82%e0%b4%af%e0%b5%8b%e0%b4%9c%e0%b4%a8%e0%b4%82/#comments</comments>
		<pubDate>Mon, 17 Mar 2008 04:46:00 +0000</pubDate>
		<dc:creator>Santhosh</dc:creator>
				<category><![CDATA[Misc]]></category>

		<guid isPermaLink="false">http://thottingal.in/blog/2008/03/16/%e0%b4%a7%e0%b5%8d%e0%b4%b5%e0%b4%a8%e0%b4%bf-%e0%b4%95%e0%b5%86-%e0%b4%a1%e0%b4%bf-%e0%b4%87-%e0%b4%b8%e0%b4%82%e0%b4%af%e0%b5%8b%e0%b4%9c%e0%b4%a8%e0%b4%82/</guid>
		<description><![CDATA[KDE ഡെസ്ക്ടോപ്പില്‍ ധ്വനി ടെക്സ്റ്റ് ടു സ്പീച്ച് സിസ്റ്റം ചേര്‍ത്തു് kedit, kate, kwrite, konqueror എന്നിവയിലുള്ള മലയാളം(ധ്വനി പിന്തുണയ്ക്കുന്ന മറ്റു ഭാഷകളും) വായിക്കാം. കോണ്‍ക്വറര്‍ വെബ് ബ്രൌസറിലും മലയാളം വെബ് പേജുകള്‍ വായിക്കാന്‍ ധ്വനി ഉപയോഗിക്കാം. ഇതിനായി ഞാന്‍ പ്രത്യേകം കോഡൊന്നും എഴുതിയിട്ടില്ല. . ktts(KDE യുടെ TTS system) കമാന്റ് പ്ലഗിന്‍ എന്ന ഒരു സൌകര്യം ഉപയോഗിച്ചാണു് ഇതു ചെയ്യാന്‍ കഴിയുന്നതു്. Kontrol center ല്‍ പോയി Regional and Accessibility എന്ന വിഭാഗത്തിലെ [...]]]></description>
			<content:encoded><![CDATA[<p>KDE ഡെസ്ക്ടോപ്പില്‍ ധ്വനി ടെക്സ്റ്റ് ടു സ്പീച്ച് സിസ്റ്റം ചേര്‍ത്തു് kedit, kate, kwrite, konqueror എന്നിവയിലുള്ള മലയാളം(ധ്വനി പിന്തുണയ്ക്കുന്ന മറ്റു ഭാഷകളും) വായിക്കാം. കോണ്‍ക്വറര്‍ വെബ് ബ്രൌസറിലും മലയാളം വെബ് പേജുകള്‍ വായിക്കാന്‍ ധ്വനി ഉപയോഗിക്കാം. ഇതിനായി ഞാന്‍ പ്രത്യേകം കോഡൊന്നും എഴുതിയിട്ടില്ല. <img src='http://thottingal.in/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> . ktts(KDE യുടെ TTS system) കമാന്റ് പ്ലഗിന്‍ എന്ന ഒരു സൌകര്യം ഉപയോഗിച്ചാണു് ഇതു ചെയ്യാന്‍ കഴിയുന്നതു്. <br />Kontrol center ല്‍ പോയി Regional and Accessibility എന്ന വിഭാഗത്തിലെ Text-to-speech എടുക്കുക. അവിടെ Talkers tab ല്‍  Add എന്ന ബട്ടണ്‍ ക്ലിക്ക് ചെയ്യുക. Synthesizer എന്നതിന്റെ Show All തിരഞ്ഞെടുത്ത് Command എന്നെടുക്കുക. Language എന്നതു് Other എന്നും. ഇപ്പോള്‍ നിങ്ങള്‍ക്ക് സിന്തസൈസറിന്റെ കമാന്റ് ചേര്‍ക്കാനുള്ള ഒരു ജാലകം കിട്ടും. അവിടെ <br />dhvani %f<br /><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_yXi4s2T6Sz4/R91CC9SFRaI/AAAAAAAAAE8/ktvgwuoJVqI/s1600-h/dhvani-ktts1.png"><img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;" src="http://4.bp.blogspot.com/_yXi4s2T6Sz4/R91CC9SFRaI/AAAAAAAAAE8/ktvgwuoJVqI/s400/dhvani-ktts1.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5178367765123515810" /></a></p>
<p><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_yXi4s2T6Sz4/R91CiNSFRbI/AAAAAAAAAFE/XSPdrU7Xqp4/s1600-h/dhvani-ktts2.png"><img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;" src="http://1.bp.blogspot.com/_yXi4s2T6Sz4/R91CiNSFRbI/AAAAAAAAAFE/XSPdrU7Xqp4/s400/dhvani-ktts2.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5178368301994427826" /></a></p>
<p>എന്നു ചേര്‍ക്കുക. ഈ Talker നെ ഡിഫോള്‍ട്ട് ആക്കുക. തീര്‍ന്നു. മുമ്പ് പറഞ്ഞ അപ്ലിക്കേനുകളിലെല്ലാം വായിക്കേണ്ട ഭാഗം സെലക്ട് ചെയ്തു് ടൂള്‍സ് മെനുവില്‍ നിന്നു് Speak text എടുക്കുക.  <br />For for information about dhvani, how to install etc see the <a href="http://fci.wikia.com/wiki/Dhvani">documentation</a></p>
]]></content:encoded>
			<wfw:commentRss>http://thottingal.in/blog/2008/03/16/%e0%b4%a7%e0%b5%8d%e0%b4%b5%e0%b4%a8%e0%b4%bf-%e0%b4%95%e0%b5%86-%e0%b4%a1%e0%b4%bf-%e0%b4%87-%e0%b4%b8%e0%b4%82%e0%b4%af%e0%b5%8b%e0%b4%9c%e0%b4%a8%e0%b4%82/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>GNOME 2.22 Released</title>
		<link>http://thottingal.in/blog/2008/03/14/gnome-2-22-released/</link>
		<comments>http://thottingal.in/blog/2008/03/14/gnome-2-22-released/#comments</comments>
		<pubDate>Sat, 15 Mar 2008 04:07:00 +0000</pubDate>
		<dc:creator>Santhosh</dc:creator>
				<category><![CDATA[Misc]]></category>
		<category><![CDATA[gnome]]></category>
		<category><![CDATA[SMC]]></category>

		<guid isPermaLink="false">http://thottingal.in/blog/?p=35</guid>
		<description><![CDATA[Gnome released its 2.22 version . The GNOME desktop and platform received many improvements and new features. It has official support for 46 languages. Malayalam, Marathi, Tamil, Gujarati and Punjabi completed more than 80% of translations and present in the supported languages. Read the release notes to know the new features]]></description>
			<content:encoded><![CDATA[<p>Gnome released its  2.22 version . The GNOME desktop and platform received many improvements and new features.<br />
It has official support for 46 languages. Malayalam, Marathi, Tamil, Gujarati and Punjabi completed more than 80% of translations and present in the supported languages.<br />
<a href="http://library.gnome.org/misc/release-notes/2.22/"><img src="http://www.gnome.org/img/flash/two-twenty-two.png"/></a></p>
<p>Read the <a href="http://library.gnome.org/misc/release-notes/2.22/"> release notes </a>to know the new features</p>
]]></content:encoded>
			<wfw:commentRss>http://thottingal.in/blog/2008/03/14/gnome-2-22-released/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>കൃഷ്ണകാന്ത് മനേ എന്ന അന്ധപ്രോഗ്രാമ്മര്‍</title>
		<link>http://thottingal.in/blog/2008/03/02/kkmane/</link>
		<comments>http://thottingal.in/blog/2008/03/02/kkmane/#comments</comments>
		<pubDate>Sun, 02 Mar 2008 18:06:00 +0000</pubDate>
		<dc:creator>Santhosh</dc:creator>
				<category><![CDATA[Misc]]></category>

		<guid isPermaLink="false">http://thottingal.in/blog/2008/03/02/%e0%b4%95%e0%b5%83%e0%b4%b7%e0%b5%8d%e0%b4%a3%e0%b4%95%e0%b4%be%e0%b4%a8%e0%b5%8d%e0%b4%a4%e0%b5%8d-%e0%b4%ae%e0%b4%a8%e0%b5%87-%e0%b4%8e%e0%b4%a8%e0%b5%8d%e0%b4%a8-%e0%b4%85%e0%b4%a8%e0%b5%8d/</guid>
		<description><![CDATA[അന്ധനായ ഒരാള്‍ക്കു് ഒരു പ്രോഗ്രാമ്മറാവാമോ? കൃഷ്ണകാന്ത് മനേ ഒരു അന്ധ പ്രോഗ്രാമ്മറാണു്. മുംബൈയിലെ ടാറ്റാ ഇന്‍സ്റ്റിട്യൂട്ട് ഓഫ് ഫണ്ടമെന്റല്‍ റിസര്‍ച്ച് സെന്ററിലെ ഗവേഷകനുമാണു്. കൂടാതെ സ്വതന്ത്ര സോഫ്റ്റ്‌വെയര്‍ പ്രചാരകനും, അന്ധരായ വിദ്യാര്‍ത്ഥികളെ പരിശീലിപ്പിക്കുന്ന വിദഗ്ദ്ധനും, പല സംസ്ഥാന സര്‍ക്കാറുകളുടെയും അന്ധര്‍ക്കായുള്ള വിദ്യാഭ്യാസ പദ്ധതികളുടെ ഉപദേശകനുമാണു്.കുറച്ചുമാസങ്ങള്‍ക്കു് മുന്‍പ്, ബാംഗ്ലൂരില്‍ വച്ചാണു് ഞാന്‍ മനേയെ പരിചയപ്പെടുന്നതു്. തന്റെ IBM thinkpad ലാപ്‌ടോപ്പില്‍ ഉബുണ്ടു ഗ്നു/ലിനക്സും ഓര്‍ക്ക(Orca) എന്ന സ്ക്രീന്‍ റീഡര്‍ സോഫ്റ്റ്‌വെയറും ഉപയോഗിച്ചു് അദ്ദേഹം നെറ്റ് ബ്രൗസ് ചെയ്യുന്നതും, മെയില്‍ [...]]]></description>
			<content:encoded><![CDATA[<p>അന്ധനായ ഒരാള്‍ക്കു് ഒരു പ്രോഗ്രാമ്മറാവാമോ? കൃഷ്ണകാന്ത് മനേ ഒരു അന്ധ പ്രോഗ്രാമ്മറാണു്. മുംബൈയിലെ ടാറ്റാ ഇന്‍സ്റ്റിട്യൂട്ട് ഓഫ് ഫണ്ടമെന്റല്‍ റിസര്‍ച്ച് സെന്ററിലെ ഗവേഷകനുമാണു്. കൂടാതെ സ്വതന്ത്ര സോഫ്റ്റ്‌വെയര്‍ പ്രചാരകനും, അന്ധരായ വിദ്യാര്‍ത്ഥികളെ പരിശീലിപ്പിക്കുന്ന വിദഗ്ദ്ധനും, പല സംസ്ഥാന സര്‍ക്കാറുകളുടെയും അന്ധര്‍ക്കായുള്ള വിദ്യാഭ്യാസ പദ്ധതികളുടെ ഉപദേശകനുമാണു്.<br />കുറച്ചുമാസങ്ങള്‍ക്കു് മുന്‍പ്, ബാംഗ്ലൂരില്‍ വച്ചാണു് ഞാന്‍ മനേയെ പരിചയപ്പെടുന്നതു്. തന്റെ IBM thinkpad ലാപ്‌ടോപ്പില്‍ ഉബുണ്ടു ഗ്നു/ലിനക്സും ഓര്‍ക്ക(Orca) എന്ന സ്ക്രീന്‍ റീഡര്‍ സോഫ്റ്റ്‌വെയറും ഉപയോഗിച്ചു് അദ്ദേഹം നെറ്റ് ബ്രൗസ് ചെയ്യുന്നതും, മെയില്‍ നോക്കുന്നതും, പ്രോഗ്രാം ചെയ്യുന്നതും കണ്ടു് ഞാന്‍ അത്ഭുതപ്പെടുപോയി. കാഴ്ചയുള്ള ആരും ചെയ്യുന്ന അതേ ലാളിത്യത്തോടുകൂടിത്തന്നെ അദ്ദേഹം അതെല്ലാം ചെയ്യുന്നു. <a href="http://live.gnome.org/Orca">Orca</a> ഒരു സ്വതന്ത്ര സ്ക്രീന്‍ റീഡര്‍ സോഫ്റ്റ്‌വെയറാണു്. അതു് സ്ക്രീനിലെ വാചകങ്ങളെ ശബ്ദമാക്കിത്തരുന്നു.  ഇംഗ്ലീഷിലാണു് അദ്ദേഹം അതുപയോഗിച്ചിരുന്നതു്. ഫെസ്റ്റിവല്‍ എന്ന ടെക്സ്റ്റ് ടു സ്പീച്ച് സോഫ്റ്റ്‌വെയറിന്റെ സഹായത്തോടെ.<br />അദ്ദേഹത്തെപ്പറ്റിയും തമിഴ്‌നാട് സര്‍ക്കാറിന്റെ ഓര്‍ക്കയും ഉബുണ്ടുവും ഉപയോഗിച്ചുള്ള അന്ധവിദ്യാര്‍ത്ഥികള്‍ക്കായുള്ള പരിപാടിയെക്കുറിച്ചുമുള്ള ഒരു വീഡിയോ താഴെക്കൊടുത്തിരിക്കുന്നു.<br /><object height="355" width="425"><param name="movie" value="http://www.youtube.com/v/lzLIKxpZV0U"><param name="wmode" value="transparent"><embed src="http://www.youtube.com/v/lzLIKxpZV0U" type="application/x-shockwave-flash" wmode="transparent" height="355" width="425"></embed></object></p>
<p>ഇംഗ്ലീഷിനുപകരം മലയാളത്തിലോ തമിഴിലോ ഉള്ള ടെക്സ്റ്റ് ടു സ്പീച്ച് സോഫ്റ്റ്‌വെയറുകള്‍ ഉപയോഗിക്കുകയാണെങ്കില്‍ അതു കൂടുതല്‍ ഉപയോഗപ്രദമായിരിക്കും.ഇതിനുള്ള ഒരു തടസ്സം ഭാരതീയ ഭാഷകള്‍ക്കു് ഉപയോഗിക്കാവുന്ന ടെക്സ്റ്റ് ടു സ്പീച്ച് സോഫ്റ്റ്‌വെയറുകളുടെ അഭാവമാണു്. ഇതിനുള്ള ഒരു പരിഹാരമാണു്, ഞാനും ഇന്ത്യന്‍ ഇന്‍സ്റ്റിട്യൂട്ട് ഓഫ് സയന്‍സ് ബാംഗ്ലൂരിലെ പ്രൊഫസറായ ഡോ: രമേഷ് ഹരിഹരനും കൂടി വികസിപ്പിച്ചു കൊണ്ടിരിക്കുന്ന <a href="http://fci.wikia.com/wiki/Dhvani">ധ്വനി</a> എന്ന സോഫ്റ്റ്‌വെയര്‍. മലയാളമടക്കം 8 ഭാഷകള്‍ ധ്വനിക്കു് സംസാരിയ്ക്കാന്‍ കഴിയും.</p>
<p>ധ്വനി ഇക്കൊല്ലത്തെ <a href="http://www.efytimes.com/efytimes/24867/news.htm">ഫോസ് ഇന്ത്യ അവാര്‍ഡിനു്</a> അര്‍ഹമായ പ്രൊജക്റ്റാണു്. NRCFOSS(National Resouce Center for Free and Open Source Software) സ്പോണ്‍സര്‍ചെയ്യുന്ന 25000 രൂപയാണു് അവാര്‍ഡ് തുക. (<a href="http://smc.org.in/">സ്വതന്ത്രമലയാളം കമ്പ്യൂട്ടിങ്ങിന്റെ</a> വേറൊരു പ്രൊജക്റ്റായ ടക്സ് ടൈപ്പിനും അവാര്‍ഡുണ്ടു്. തൃശൂര്‍ എഞ്ചിനീയറിങ്ങ് കോളേജിലെ മോബിന്‍ , ശ്രീരഞ്ജ് , ശ്രേയസ്, പ്രിന്‍സ്, വിമല്‍ എന്നീ വിദ്യാര്‍ത്ഥികള്‍ക്കാണു് അവാര്‍ഡ് ലഭിച്ചതു്.)<br />ധ്വനി ഉപയോഗിച്ചു് എങ്ങനെ മലയാളം ടെക്സ്റ്റുകളെ mp3/ogg ആക്കി മാറ്റാമെന്നറിയാന്‍ <a href="http://santhoshtr.livejournal.com/8759.html"> ഈ ബ്ലോഗ് പോസ്റ്റ് </a> വായിക്കുക.</p>
]]></content:encoded>
			<wfw:commentRss>http://thottingal.in/blog/2008/03/02/kkmane/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>വേഗനിയന്ത്രണത്തിനായി പുതിയൊരു മാര്‍ഗ്ഗം!</title>
		<link>http://thottingal.in/blog/2008/01/23/%e0%b4%b5%e0%b5%87%e0%b4%97%e0%b4%a8%e0%b4%bf%e0%b4%af%e0%b4%a8%e0%b5%8d%e0%b4%a4%e0%b5%8d%e0%b4%b0%e0%b4%a3%e0%b4%a4%e0%b5%8d%e0%b4%a4%e0%b4%bf%e0%b4%a8%e0%b4%be%e0%b4%af%e0%b4%bf-%e0%b4%aa%e0%b5%81/</link>
		<comments>http://thottingal.in/blog/2008/01/23/%e0%b4%b5%e0%b5%87%e0%b4%97%e0%b4%a8%e0%b4%bf%e0%b4%af%e0%b4%a8%e0%b5%8d%e0%b4%a4%e0%b5%8d%e0%b4%b0%e0%b4%a3%e0%b4%a4%e0%b5%8d%e0%b4%a4%e0%b4%bf%e0%b4%a8%e0%b4%be%e0%b4%af%e0%b4%bf-%e0%b4%aa%e0%b5%81/#comments</comments>
		<pubDate>Wed, 23 Jan 2008 21:22:00 +0000</pubDate>
		<dc:creator>Santhosh</dc:creator>
				<category><![CDATA[Misc]]></category>

		<guid isPermaLink="false">http://thottingal.in/blog/2008/01/23/%e0%b4%b5%e0%b5%87%e0%b4%97%e0%b4%a8%e0%b4%bf%e0%b4%af%e0%b4%a8%e0%b5%8d%e0%b4%a4%e0%b5%8d%e0%b4%b0%e0%b4%a3%e0%b4%a4%e0%b5%8d%e0%b4%a4%e0%b4%bf%e0%b4%a8%e0%b4%be%e0%b4%af%e0%b4%bf-%e0%b4%aa%e0%b5%81/</guid>
		<description><![CDATA[കേരളത്തിലെ റോഡുകളില്‍ നിന്നു് പ്രചോദനമുള്‍ക്കൊണ്ടു് വാഹനങ്ങളുടെ വേഗനിയന്ത്രണത്തിനായി പുതിയൊരു മാര്‍ഗ്ഗം! താഴെക്കൊടുത്തിരിക്കുന്ന ചിത്രങ്ങളൊന്നു നോക്കൂ!!! ചിത്രങ്ങള്‍ക്ക് കടപ്പാട്: http://tides.ws/2008/01/20/new-speed-controlling-device/]]></description>
			<content:encoded><![CDATA[<p>കേരളത്തിലെ റോഡുകളില്‍ നിന്നു് പ്രചോദനമുള്‍ക്കൊണ്ടു് വാഹനങ്ങളുടെ വേഗനിയന്ത്രണത്തിനായി പുതിയൊരു മാര്‍ഗ്ഗം!  താഴെക്കൊടുത്തിരിക്കുന്ന ചിത്രങ്ങളൊന്നു നോക്കൂ!!!</p>
<p><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://tides.ws/wp-content/uploads/2008/01/060.jpg"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 320px;" src="http://tides.ws/wp-content/uploads/2008/01/060.jpg" alt="" border="0" /></a></p>
<p><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://tides.ws/wp-content/uploads/2008/01/060_0011.jpg"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 320px;" src="http://tides.ws/wp-content/uploads/2008/01/060_0011.jpg" alt="" border="0" /></a><br /><a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://tides.ws/wp-content/uploads/2008/01/060_002.jpg"><img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 320px;" src="http://tides.ws/wp-content/uploads/2008/01/060_002.jpg" alt="" border="0" /></a><br />ചിത്രങ്ങള്‍ക്ക് കടപ്പാട്: http://tides.ws/2008/01/20/new-speed-controlling-device/</p>
]]></content:encoded>
			<wfw:commentRss>http://thottingal.in/blog/2008/01/23/%e0%b4%b5%e0%b5%87%e0%b4%97%e0%b4%a8%e0%b4%bf%e0%b4%af%e0%b4%a8%e0%b5%8d%e0%b4%a4%e0%b5%8d%e0%b4%b0%e0%b4%a3%e0%b4%a4%e0%b5%8d%e0%b4%a4%e0%b4%bf%e0%b4%a8%e0%b4%be%e0%b4%af%e0%b4%bf-%e0%b4%aa%e0%b5%81/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
<!-- WP Super Cache is installed but broken. The path to wp-cache-phase1.php in wp-content/advanced-cache.php must be fixed! -->
