Technical Management (OCR) Based On Machine Learning And Software Development Work From Home Internship at IIT Bombay

Technical Management (OCR) Based On Machine Learning And Software Development

IIT BombayLocation(s): Work from home

Start DateImmediately

Duration6 Months

StipendRs. ₹ 1,000-3,000 /month

Posted On22 Feb '26

Apply By31 May' 21

DESCRIPTION

About the work from home job/internship

Selected intern's day-to-day responsibilities include:

1. Understanding the requirements in detail and eliciting them from the involved stakeholders
2. Leading the machine learning team and guiding them in the challenges that they are facing
3. Collaborating with the backend development team and ensuring the requirements are implemented correctly and the project is headed on the right track
4. Organizing meetings with all the required stakeholders and discussing the requirements, issues, challenges, updates, and progress relevant to the project
5. Overseeing that the development and testing are done as per industry standards and a proper pipeline is followed to ensure smooth operations of the project
6. Deploying the system on private servers and then handling the server operations
7. Managing the team and overseeing the tasks done by them
8. Recruiting skilled team members as per the needs of the project
9. Coordinating with industry experts and taking technical inputs from them in the challenges faced during the project
10. Maintaining the product backlog and software documentation

Note: This is a technical management role where you would be required to lead a backend team working on C++/Python and also to lead the ML team. Technical guidance will be provided by industry experts or professors but the candidate needs to execute the work with the team and manage it efficiently.

Skill(s) required

C++ Programming Django Machine Learning Python Software Testing

Earn certifications in these skills

Learn C++ Programming

Learn Machine Learning

Learn Python

Learn Software Testing

Who can apply

Only those candidates can apply who:

1. are available for the work from home job/internship

2. can start the work from home job/internship between 2nd May'21 and 6th Jun'21

3. are available for duration of 6 months

4. have relevant skills and interests

Perks

Certificate Letter of recommendation Flexible work hours 5 days a week

Additional information

We are working on detecting the layout of the OCRed text, preserving it, and reflecting it to automate the annotation task. Optical character recognition (OCR) is the process of converting document images into an editable electronic format. OCR in Indian languages is quite challenging due to richness in inflections. Using open-source and commercial OCR systems, we have observed the word error rates (WER) of around 20-50% on printed documents in four different Indic languages. Moreover, developing a highly accurate OCR system with an accuracy as high as 90% is not useful unless aided by the mechanism to identify errors. So, we started with the problem of developing "OpenOCRCorrect", an end-to-end framework for error detection and corrections in Indic-OCR. Our models outperform state-of-the-art results in 'Error Detection in Indic-OCR' for six Indic languages with varied inflections and we have solved the out of vocabulary problem for “Error Correction in Indic-OCR” in our ICDAR-2017 conference paper. We further improve the results with the help of sub-word embeddings in our ICDAR-2019 conference paper.

There is an immediate demand to keep the softcopy of the Indian preserved texts. Currently, we are targeting Sanskrit. Although the OCR tools available online do a decent job on English texts, they are not optimized for Indic languages. Thus, we developed an in-house OCR model for the same. The model can detect text with the maximum level of accuracy and can draw bounding boxes on each line of the text. Further, in the digitization process of such texts, the second step would be spelling correction and formatting of the text detected by the OCR models. The demo video for our framework is available here: https://www.youtube.com/watch?v=u9bqUDrGugc. To install the software, you can go to https://github.com/rohitsaluja22/OpenOCRCorrect and follow the instructions given in https://www.youtube.com/watch?v=0hcdlF-zn8E.

Our team 'CLAM' secured 2nd position in the multilingual PostOCR competition at ICDAR'19. Our model achieved the highest corrections of 44% in Finnish, which is significantly higher than the overall topper (8% in Finnish). Our final report for the same can be checked here: https://drive.google.com/file/d/1uuBWu1LQ1QZ49SCgLBoB1er4HpWSzmcx/view. The source code for our framework is available here: https://github.com/rohitsaluja22/OpenOCRCorrect

You can read our paper here:

1. ICDAR 2019: https://www.cse.iitb.ac.in/~rohitsaluja/PID6011473.pdf
2. ICDAR 2017: https://ieeexplore.ieee.org/document/8269944
3. ICDAR-OST 2017: https://ieeexplore.ieee.org/abstract/document/8270254

Number of openings

About IIT Bombay

Website

The Indian Institute of Technology, Bombay (IITB) is one of the fifteen higher institutes of technology in the country, set up intending to make facilities available for higher education, research, and training in various fields of science and technology. Professor Ganesh Ramakrishnan (department of CSE) and professor Ramasubramanian (department of humanities and social sciences) are attempting to significantly speed up the process of digitization of Sanskrit texts. Enabled by the OCR and post-editing related technologies developed at IIT Bombay, they are now seeking the participation of the community of Sanskrit lovers, software developers, machine learning enthusiasts, project managers, etc.

Activity on Internshala

Hiring since December 2013

415 opportunities posted

110 candidates hired

Candidate sign up

By continuing as a candidate, you agree to our T&C.