Install Ubuntu onto VMware Player with OCRopus

1. Download latest VMware Player (free)
2. Download latest Ubuntu server .iso file.
3. Create virtual OS in VMware
4. Browse to Ubuntu iso and follow installation instruction.
5. If webserver is needed, install LAMP follow instructions below.

LAMP (Linux, Apache, MySQL and PHP) is an open source Web development platform that uses Linux as operating system, Apache as the Web server, MySQL as the relational database management system and PHP as the object-oriented scripting language.

We did show you in our previous post how to install LAMP in Ubuntu 10.04 with one command using tasksel command. It is a software installation application that is an integral part of the Debian installer and works under Ubuntu Linux too. It groups some packages by tasks and offers the user an easy way to install the packages for that task. It provides the same functionality as using conventional meta-packages. in Maverick this command dosn`t come by default, so we need to install it first before to perform the LAMP installation.

Open terminal and Type the command :install it first with

sudo apt-get install tasksel

Now to install LAMP, type the taskel command in terminal :

sudo tasksel

And select LAMP Server:

During the installation you will be asked to insert the mysql root password

Now check if php is working :

$sudo vi /var/www/info.php

and add
view source

save and exit

restart apache2 ,

#sudo /etc/init.d/apache2 restart

Now open browser and type :

http://ip/info.php or http://localhost/info.php

Php is installed.

To full manage your lamp Server database, install phpmyadmin

sudo apt-get install phpmyadmin

To login to phpmyadmin, open browser and type :

http://ip/phpmyadmin or http://localhost/phpmyadmin

6. To install OCRopus, go to and follow download install instruction. Or Or Openfst link provide is no longer available, instead get it from

If encounter error on arrayobject.h error with no such file and directory during make on ocroswig, simply install numpy using sudo apt-get install python-numpy.

How to use ocroscript =>
$ ocroscript recognize /path/to/file.png > /path/to/output.html
$ ocroscript recognize –tessLanguage=eng –output-mode=text ScanPagesPSLulu.jpg

7. Ubuntu sometime have problem with crawling internet in vmware, can try to disable IPv6 by issue
Using sysctl you can disable IPv6 on the running system without rebooting:

sysctl -w net.ipv6.conf.all.disable_ipv6=1

To disable permanently add “net.ipv6.conf.all.disable_ipv6=1″ to /etc/sysctl.conf.

run sysctl -p.


