Data Scientist

Description

We are looking for a Data Scientist that will help us discover the information hidden in vast amounts of data, and help us make smarter decisions to deliver even better products. Your primary focus will be in applying data mining techniques, doing statistical analysis, and building high quality prediction systems integrated with our products.
You will be responsible for researching, innovate and implementing state-of-the-art algorithms using deep learning, reinforcement learning techniques in Natural Language Processing task, Machine Reading Comprehension, Recognizing Textual Entailment, Document Classification, Text Analytics, Sentiment Analysis, recommendation engine, A/B testing and more.
Experience: 3-5 Years

Responsibilities

  • Selecting features, building and optimizing classifiers using machine learning techniques
  • Data mining using state-of-the-art methods
  • Extending company’s data with third party sources of information when needed
  • Enhancing data collection procedures to include information that is relevant for building analytic systems
  • Processing, cleansing, and verifying the integrity of data used for analysis
  • Doing ad-hoc analysis and presenting results in a clear manner
  • research and innovation of state-of-the-art papers in NLP problems
  • Working with Backend Engineers to ship your models to production and publish research in top journals e.g.: NIPS, Arxiv and Nature

Skills and Qualifications

  • Proficiency in Python data science tools.
  • Experience in modern (DL) Deep Learning and Natural Language Processing / Natural Language Understanding (NLP, NLU), including Neural Networks, RNNs, seq2seq+attention models, and real world (ML) machine learning in TensorFlow.
  • Great communication skills
  • Experience with data visualisation tools, such as D3.js, GGplot, etc.
  • Proficiency in using query languages such as SQL, Hive or Pig.
  • Good applied statistics skills, such as distributions, statistical testing, regression, etc.
  • Experience building production-ready NLP systems
  • Familiarity with non-standard machine intelligence models (Reinforcement Learning, Hierarchical Temporal Memory, Capsule Networks) is a plus
  • Familiarity with Distributed systems (Docker, Kubernetes, Kafka, Spark, Redis, AWS S3/EC2/RDS/KMS, MongoDB, or Lucene) is a plus
  • Proficient understanding of code versioning tools such as Git, Mercurial or SVN, continuous integration tool like Jenkins.
  • Bachelor’s degree or higher in a technical field of study

Location: Indore | Openings