കേരളത്തിന്റെ സാക്ഷരതാപ്രസ്ഥാനത്തിന്റെ തുടർച്ചയായി സാർവത്രിക ഡിജിറ്റൽ സാക്ഷരതാ പദ്ധതിയ്ക്കായി ചില ശ്രമങ്ങൾ 2019 സെപ്റ്റംബർ മുതൽ ഞാൻ നടത്തിയിരുന്നു. സർക്കാറിന്റെയും മറ്റുസ്ഥാപനങ്ങളുടെയും ആഭിമുഖ്യത്തിൽ ആ പദ്ധതി മാർച്ചിൽ തുടങ്ങാനിരിക്കെ കോവിഡ്-19 മഹാമാരി ഗുരുതരമാകുകയും പദ്ധതിയുടെ ഭാവി അനിശ്ചിതത്തിലാവുകയും ചെയ്തു. അതേ സമയം ഇത്തരമൊരു പദ്ധതിയുടെ ആവശ്യകത ഇരട്ടിക്കുകയും ചെയ്തു. പദ്ധതിയെപ്പറ്റി സർക്കാറിന്റെ ഔദ്യോഗികപ്രഖ്യാപനങ്ങൾ വന്നോട്ടെ എന്നുകരുതി ഞാൻ ഒന്നും പബ്ലിക്ക് ആയി എഴുതിയിരുന്നില്ല. പുതിയ സാഹചര്യത്തിൽ ഈ പദ്ധതിയ്ക്കുവേണ്ടിയുള്ള ഞാനടക്കമുള്ള ഒരുപാടുപേരുടെ പ്രയത്നത്തെ രേഖപ്പെടുത്താനാണ് ഈ ലേഖനം.
തുടക്കം സ്വതന്ത്ര മലയാളം കമ്പ്യൂട്ടിങ്ങിന്റെ പ്രവർത്തനഫലങ്ങൾ കൂടുതൽ ആളുകളിലേക്കെത്തുകയും അതോടൊപ്പം കമ്പ്യൂട്ടർ, മൊബൈൽ, ഇന്റർനെറ്റ് തുടങ്ങിയവ എല്ലാവരും വ്യാപകമായി ഉപയോഗിക്കാൻ തുടങ്ങുകയും ചെയ്തതോടെ കഴിഞ്ഞ കുറേ വർഷങ്ങളായി എന്റെയും സുഹൃത്തുക്കളുടെയും നല്ലൊരുഭാഗം സമയം ഇതുസംബന്ധിച്ച സംശയനിവാരണത്തിനും സഹായങ്ങൾക്കുമായി മാറ്റിവെക്കേണ്ടിവരുന്നുണ്ട്.
[Read More]
Life in the time of lockdown
It has been more than a month since lockdown is announced in Kerala. The world has changed and never going to be same. I thought of leaving a write up here, mainly for future self to recollect how did I spend this time at home.
Sunday, March 21st was general curfew in India. On the same day, Kavya’s grand father passed away at Thodupuzha. Since travel was not allowed on the day, we could not travel from Thiruvananthapuram to Thodupuzha.
[Read More]
Manjari version 1.800 released
A new version of Manjari Malayalam typeface is available now. Version 1.800 adds tabular number and slashed zero opentype features along with bug fixes.
New version of the font is available at: https://smc.org.in/fonts/#manjari
Tabular numbers The Kerala health department publishes daily COVID-19 reports and they use Manjari(example). Sometimes the numbers in table in Manjari font is slightly difficult to read when you want to compare the numbers in columns across rows.
[Read More]
Chilanka font version 1.500 released
A new version of Chilanka Malayalam typeface is available now. Version 1.500 adds glyphs required to support International Alphabet of Sanskrit Transliteration
New version of the font is available at: https://smc.org.in/fonts/#chilanka
Chilanka was the first typeface I designed in 2014 for Malayalam. The usecase I had in mind while designing it was to use it in comic or non-serious contexts. But once it was released, contrary to my expectation, people used for serious Malayalam writing.
[Read More]
Procrustes Analysis Based Handwriting Recognition
Many months back, I started an experiment to see if Malayalam handwriting recognition can be done in a non-machine learning based approach. This blog post explains the approach, the work done so far and results.
Handwriting recognition can be done while the user is writing(called online handwriting recognition) and recognizing a sample somebody wrote in the past(offline recognition). Online and offline recognition problems are different problems. This is because, in online recognition, it is possible to capture additional details such as pen up, pen down, pen movement directions and rotations.
[Read More]
Markov chain for Malayalam
I have been trying to generate a Markov chain for Malayalam content. A Markov chain is a stochastic model describing a sequence of possible events in which the probability of each event depends only on the state attained in the previous event.(wikipedia). For natural language, it represents a probabilistic model of words- the probability that one word can come after another word. This model can be prepared by feeding large amount of text to system that learns the probabilities of each words.
[Read More]
Updated web interface for mlmorph
The web interface of Malayalam morphology analyser(mlmorph) is updated. You can see new interface at https://morph.smc.org.in/. The new web application is written in vuejs using vuetify UI framework. The backend is flask. Source code is available at https://gitlab.com/smc/mlmorph-web
Chilanka version 1.400 released
A new version of Chilanka typeface is available now. Version 1.400 is available for download from SMC’s font download and preview site smc.org.in/fonts
For users, there is not much changes, but the source and code build system got a major upgrade.
Source code updated to UFO format from fontforge sfd format. This allows to work with modern font editors. Use cubic beziers for master design, generate OTF along with TTF. The original drawings for Chilanka was using cubic beziers.
[Read More]
Lexicon Curation for Mlmorph
One of the key components of Mlmorph is its lexicon. The lexicon contains the root words categorized as nouns, verbs, adjectives, adverbs etc. These are the components used with morphological rules to generate the vocabulary of Malayalam. I collected initial lexicon with about 100,000 words from various sources such as Wikipedia, CLDR and many targeted web crawls. One problem with such collected words is they often contains spelling mistakes. Secondly, classifying these words is not possible without the tedious task of a person going through each and every words.
[Read More]
LibreOffice Malayalam spellchecker using mlmorph
A few months back, I wrote about the spellchecker based on Malayalam morphology analyser. I was also trying to intergrate that spellchecker with LibreOffice. It is not yet ready for any serious usage, but if you are curious and would like to help me in its further development, please read on.
Malayalam spellchecker – a morphology analyser based approach Blog post on spellchecker approach and pla Current status The libreoffice spellchecker for Malayalam is available at https://gitlab.
[Read More]