Seasons Of Code

Language Detection    • Tejpal kumawat   

WnCC - Seasons of Code

Seasons of Code is a programme launched by WnCC along the lines of the Google Summer of Code. It provides one with an opprtunity to learn and participate in a variety of interesting projects under the mentorship of the very best in our institute.


List of Running Projects

Language Detection

Language Detection


Natural Language Processing which will predict language (English , hindi , chinese and French)

No. of mentees: 5

Description:

  • Project will be based on Natural Language Processing which will predict language (English , hindi , chinese and French)
  • Predict the language of the written text and also language of the text from the images.
  • Read text from images by Optical Character Recognition (OCR) using tensorflow.
  • Use concepts of natural language processing and deep learning to do this
  • Will deploy the model on the heroku with the help of streamlit.

Resources:

  • Streamlit - https://www.youtube.com/playlist?list=PLtqF5YXg7GLmCvTswG32NqQypOuYkPRUE

  • NLP- https://www.youtube.com/watch?v=fM4qTMfCoak&list=PLZoTAELRMXVMdJ5sqbCK2LiM0HhQVWNzm

  • OCR- https://www.youtube.com/watch?v=aELZtpOClWk&list=PLreVlKwe2Z0QKobecSxrheGzrgd4iXJje

  • Github- https://github.com/tejpal123456789/Natural-Language-Processing/blob/main/language_detection.ipynb

Tentative Project Timeline

Week Number Tasks to be Completed
Week 1 Learn basics of python and try to understand basics of deep learning
Week 2 Start learning about natural language processing like how to clean data, how to convert text into vector (basic level) and also about OCR
Week 3 Learn how to train model with simple machine learning algorithm
Week 4 Learn basics of the streamlit
Week 5 Apply all these things on the capstone.

Checkpoints:

Checkpoint Number Progress
1 Python
2 Natural language Processing
3 OCR and its application
4 Model training using python and basics of the stremlit
5 Final Project