Language Detection


  • Machine Learning


Mentors :

  • Tejpal kumawat

Mentees :

  • 5 students


Description: Project will be based on Natural Language Processing which will predict language (English , hindi , chinese and French) Predict the language of the written text and also language of the text from the images. Read text from images by Optical Character Recognition (OCR) using tensorflow. Use concepts of natural language processing and deep learning to do this Will deploy the model on the heroku with the help of streamlit.

Resources:

Streamlit - https://www.youtube.com/playlist?list=PLtqF5YXg7GLmCvTswG32NqQypOuYkPRUE

NLP- https://www.youtube.com/watch?v=fM4qTMfCoak&list=PLZoTAELRMXVMdJ5sqbCK2LiM0HhQVWNzm

OCR- https://www.youtube.com/watch?v=aELZtpOClWk&list=PLreVlKwe2Z0QKobecSxrheGzrgd4iXJje

Github- https://github.com/tejpal123456789/Natural-Language-Processing/blob/main/language_detection.ipynb


Tentative Timeline :

Week Number Tasks to be Completed
Week 1 Learn basics of python and try to understand basics of deep learning
Week 2 Start learning about natural language processing like how to clean data, how to convert text into vector (basic level) and also about OCR
Week 3 Learn how to train model with simple machine learning algorithm
Week 4 Learn basics of the streamlit
Week 5 Apply all these things on the capstone.

Checkpoints :

Checkpoint Number Progress
1 Python
2 Natural language Processing
3 OCR and its application
4 Model training using python and basics of the stremlit
5 Final Project