
On-Site
Full-Time
Noida, Uttar Pradesh
India
About the Role
About the Company
We are looking for an experienced Python Developer with expertise in using TensorFlow/PyTorch, LangChain, OpenAI API, Elasticsearch and a deep understanding of Natural Language Processing to help us develop and optimize high-performance applications.
About the Role
You will be responsible for implementing, testing, and maintaining data pipelines, machine learning models, and NLP techniques to extract valuable insights from data.
Responsibilities
Design, develop, and maintain Python-based data analysis and machine learning applications with clean and well-documented code
Develop, optimize and deploy ML models for information retrieval, LLM-based agents, embeddings (FAISS, Pinecone, Weaviate), predictive analytics, and Retrieval-Augmented Generation (RAG)
Research and implement NLP algorithms for text classification, sentiment analysis, named entity recognition (NER), and topic modeling, including troubleshooting and debugging to ensure reliable performance at scale
Implement data pipelines and ETL processes for big data processing
Collaborate with cross-functional teams to understand business requirements and build scalable tech
Qualifications
Strong proficiency in Python with hands-on experience in libraries like Pandas, NumPy, scikit-learn, TensorFlow, PyTorch
Expertise in information retrieval, statistical analysis, data visualization and developing LLM-based agents, embeddings (FAISS, Pinecone, Weaviate), predictive analytics, and Retrieval-Augmented Generation (RAG)
Hands-on experience with Natural Language Processing (NLP) libraries such as NLTK, spaCy, Hugging Face, or similar tools
Experience with data wrangling techniques, including cleaning, transforming, and merging data sets from various sources
Familiarity with machine learning algorithms and frameworks (supervised, unsupervised learning, and deep learning techniques)
Solid understanding of text analytics such as text pre-processing, tokenization, stemming, lemmatization, and part-of-speech tagging
Experience with cloud platforms (AWS, GCP, or Azure) and containerization (Docker, Kubernetes) is a plus.
Knowledge of data visualization tools (Matplotlib, Seaborn, ggplot2, Plotly, etc.)
Strong problem-solving skills and attention to detail with ability to work in an agile, fast-paced environment and deliver results under tight deadlines
Required Skills
4 year Bachelor’s degree in Computer Science, Information Technology, Data Science, Statistics or related domains, or equivalent qualification
4+ years in developing scalable ML models, NLP models and systems from 0 to 1 and deploying them to production
Strong knowledge of RESTful APIs and GraphQL for frontend-backend communication
Familiarity with version control using Git, CI/CD tools, and deployment pipelines
Knowledge of big data tools and platforms (Spark, Hadoop, etc.) and experience with managing databases
We are looking for an experienced Python Developer with expertise in using TensorFlow/PyTorch, LangChain, OpenAI API, Elasticsearch and a deep understanding of Natural Language Processing to help us develop and optimize high-performance applications.
About the Role
You will be responsible for implementing, testing, and maintaining data pipelines, machine learning models, and NLP techniques to extract valuable insights from data.
Responsibilities
Design, develop, and maintain Python-based data analysis and machine learning applications with clean and well-documented code
Develop, optimize and deploy ML models for information retrieval, LLM-based agents, embeddings (FAISS, Pinecone, Weaviate), predictive analytics, and Retrieval-Augmented Generation (RAG)
Research and implement NLP algorithms for text classification, sentiment analysis, named entity recognition (NER), and topic modeling, including troubleshooting and debugging to ensure reliable performance at scale
Implement data pipelines and ETL processes for big data processing
Collaborate with cross-functional teams to understand business requirements and build scalable tech
Qualifications
Strong proficiency in Python with hands-on experience in libraries like Pandas, NumPy, scikit-learn, TensorFlow, PyTorch
Expertise in information retrieval, statistical analysis, data visualization and developing LLM-based agents, embeddings (FAISS, Pinecone, Weaviate), predictive analytics, and Retrieval-Augmented Generation (RAG)
Hands-on experience with Natural Language Processing (NLP) libraries such as NLTK, spaCy, Hugging Face, or similar tools
Experience with data wrangling techniques, including cleaning, transforming, and merging data sets from various sources
Familiarity with machine learning algorithms and frameworks (supervised, unsupervised learning, and deep learning techniques)
Solid understanding of text analytics such as text pre-processing, tokenization, stemming, lemmatization, and part-of-speech tagging
Experience with cloud platforms (AWS, GCP, or Azure) and containerization (Docker, Kubernetes) is a plus.
Knowledge of data visualization tools (Matplotlib, Seaborn, ggplot2, Plotly, etc.)
Strong problem-solving skills and attention to detail with ability to work in an agile, fast-paced environment and deliver results under tight deadlines
Required Skills
4 year Bachelor’s degree in Computer Science, Information Technology, Data Science, Statistics or related domains, or equivalent qualification
4+ years in developing scalable ML models, NLP models and systems from 0 to 1 and deploying them to production
Strong knowledge of RESTful APIs and GraphQL for frontend-backend communication
Familiarity with version control using Git, CI/CD tools, and deployment pipelines
Knowledge of big data tools and platforms (Spark, Hadoop, etc.) and experience with managing databases