Back to jobs

Data Engineer II

ABOUT US

We are Ai Palette. We think the world would be a nicer place to be if Food Companies could create products that the consumers really want. So we are making it happen. We want to be the most preferred Food AI company in the world. We’re making it possible by building an AI-powered SaaS platform based on our founders’ experience in the Food Industry & AI.

We are a Series A round funded company with experienced investors like pi Ventures, Exfinity ventures and Anthill ventures. We are building a global company and are already working with customers across the globe. Our customers include Fortune 500 companies and FMCG brands.

We have won several accolades & awards in our short period of existence such as:

  • Top 10 Global Food & Retail Tech startup at Kickstart Innovation'20 at Switzerland
  • Top 15 Global Food tech Startup at Slingshot'19 at Singapore
  • Top 200 Global startups at HKSTP Epic'20 at HongKong
  • Top 500 Global Food Tech startups by Forward Fooding

 

WHAT’S IT LIKE TO WORK AT AI PALETTE? 

 

We're a growing technology startup headquartered in Singapore with our Engineering base in Bangalore and a team in US, We PaletteeRs are a highly passionate and motivated bunch of people that help each other do remarkable things and achieve extraordinary results every single day. We are active learners, have a positive impact on consumers’ lives and settle for nothing short of excellence. We face challenges together and we win together. In our vision to build the World's First AI Platform for New Product Innovation, there isn’t a day that goes by where we don’t have “Aha” moments.  We strive together to deliver world-class solutions that transform the way consumer products of today and tomorrow will be created. Join us!

About the job

We are looking for a passionate Data Engineer who will be working in the Data Engineering division and core development of our AI Platform that will ideate consumer products of the future. As the Data Engineer, you will be working hand in hand with the Data Science and Full Stack team on the toughest and most challenging problems in Data Engineering and Cloud Computing handling millions of data points that include social media. You will get the opportunity to work and scale a growing Data Platform.

You don’t want a job that just pays the bills: you want a job to get out of bed for. You take more enjoyment from solving problems than your friends think is normal. If you are looking to find your “ikigai”, then your search stops here right with us!

RESPONSIBILITIES

  1. Build the data collection pipelines that can acquire and handle millions of public data points from various sources using APIs and web extraction techniques at scale
  2. Build the data cleaning, quality and integrity pipeline in the platform leveraging the Apache Spark Python and AWS services
  3. Architect and develop distributed systems that can handle large-scale data processing
  4. Programming Language: Python/Java, Apache Spark, Apache Flink (Good to have)
  5. NoSQL Database: Elasticsearch, Dynamo DB/Mongo DB
  6. Work closely with the Data Science Team for the preprocessing steps required for the AI models in Production Environment
  7. Scale and automate Data Platform Collection layer to handle the consistently incoming data
  8. Implement data quality checks and validation processes
  9. Troubleshoot and resolve performance issues
  10. Collaborate with cross-functional teams to support data-related initiatives
  11. Document data pipelines, data models, and other technical processes

REQUIREMENTS

  1. 4-6 years of experience in building data engineering pipeline
  2. Have hand ons experience on Apache Spark, PySpark, SQL, Python/Java programming
  3. Worked with NoSQL database like Elasticsearch, Dynamo DB/Mongo DB, Cassandra - Anyone
  4. Strong experience in AWS Cloud Platform - S3, EC2, Lambda, Elasticsearch etc
  5. Quick learner, excellent communication and team player
IDEAL
  • Deep technical understanding of AWS PaaS and IaaS services
  • Previous experience with Airflow, Spark, PySpark, Python, ElasticSearch, Web Crawling and Docker
  • Prior experience with social media data (twitter, reddit, blogs, etc.)

BENEFITS

  • Great progression opportunities - we want you to grow with us.
  • Look after yourself with health insurance including Hospital/Surgical.
  • Learn new skills with sponsored training on MOOCs such as Coursera, Udemy.  

EQUAL OPPORTUNITY: 

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Apply for this job

*

indicates a required field

Resume/CV

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

Accepted file types: pdf, doc, docx, txt, rtf


Select...