(keitai-l) Kanji OCR & english translation on 505i series DoCoMo phones using integrated camera

From: Petr Vyskocil <petr_at_anime.cz>
Date: 12/18/03
Message-ID: <006401c3c536$f8bfefa0$57a45da5@puchiko>
Hi all,

I have developped a kanji OCR application for mobiles (and afterwards found
out it was mentioned on this list already...); and since it reached a stage
when it is becoming a bit usable (do not expect wonders...), I've decided to
ask you for a bit of stress-testing ;)

URL: http://kohaku.anime.cz/docoreco.html

please DO NOT pass this URL to non-programmers as the server is 1) not yet
completely stable and 2) not fast enough for public release (just 500MHz
PIII).

Short instructions: start the program, press [SELECT] to take a picture of
japanese text rotated by 90 degrees (i.e. hold the phone by right hand and
take a picture of a text going from the top of the display to the bottom).
Look at the result of postprocessing - if it is black text on white
background that is readable, send it to server using [SELECT], and english
translation provided by excite web service will be (if you're lucky)
returned to you. If the result of preprocessing is rubbish, press
[SOFT_BTN_1] and try again.

The code works on left-to-right text only for now, do not try it in
top-to-bottom. Theoretically it can process also hand-written input, but...
let's say it's not working on my gaijin-style kanjis, but it will work on
the scribbless done by the natives ^_^.

The program should work on all DoJa 3.0 phones (505i series), but was tested
only on F505i - please post your results with other models.

Communication sizes: program size ~12kB, typically below 2kB of transmitted
data per one recognition, with the upper limit being 6kB.

Both server and client are under development, so there may be some outages -
I promise I'll try to minimize them :)

Hoping to see gaijins in supermarkets taking pictures of unknown texts from
food labels :)

Best regards,

    Petr Vyskocil
    Tokyo University of Agriculture & Technology
    Nakagawa Laboratories

Copyright stuff: in time this will be probably released under GPL, but for
now the program is (C) Tokyo University of Agriculture & Technology, except
GIF encoding routines:
// GifEncoder - write out an image as a GIF
//
// DoJa implementation by Petr Vyskocil
//
// Copyright (C)1996,1998 by Jef Poskanzer <jef@acme.com>. All rights
reserved.
//
// Redistribution and use in source and binary forms, with or without
// modification, are permitted provided that the following conditions
// are met:
// 1. Redistributions of source code must retain the above copyright
//    notice, this list of conditions and the following disclaimer.
// 2. Redistributions in binary form must reproduce the above copyright
//    notice, this list of conditions and the following disclaimer in the
//    documentation and/or other materials provided with the distribution.
//
// THIS SOFTWARE IS PROVIDED BY THE AUTHOR AND CONTRIBUTORS ``AS IS'' AND
// ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
// IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR
PURPOSE
// ARE DISCLAIMED.  IN NO EVENT SHALL THE AUTHOR OR CONTRIBUTORS BE LIABLE
// FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR
CONSEQUENTIAL
// DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS
// OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
// HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT,
STRICT
// LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY
// OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF
// SUCH DAMAGE.
Received on Thu Dec 18 09:20:18 2003