Data Platform Engineer
MongoDB, Buenos Aires
MongoDB unleashes the power of software and data for innovators everywhere
Headquartered in New York, with offices across North America, Europe, and Asia-Pacific, MongoDB has more than 15,000 customers, which include some of the largest and most sophisticated businesses in nearly every vertical industry, in over 100 countries.
MongoDB is growing rapidly and seeking a Senior Data Engineer to be a key contributor to the overall internal data platform at MongoDB. You will build data driven solutions to help drive MongoDB's growth as a product and as a company. You will take on complex data-related problems using very diverse data sets.
MongoDB is growing rapidly and seeking a Data Platform Engineer to be a key contributor to the overall internal data platform at MongoDB. You will design and build a scalable data driven platform to help drive MongoDB’s growth as a product and as a company, while also taking on complex platform problems with the goal of making our platform more scalable, reliable, and robust.
Our ideal candidate has experience with:
- several programming languages (Python, Scala, Java, etc.)
- AWS services such as EMR, Lambda, S3, Athena, Glue, IAM, RDS, etc.
- orchestration tools such as Airflow, Luiji, Azkaban, Cask, etc.
- streaming data processing frameworks like Kafka, KSQ, and Spark Streaming
- a diverse set of databases like MongoDB, Cassandra, Redshift, Postgres, etc.
- different storage formats such as Parquet, Avro, Arrow, and JSON
- data processing frameworks like Spark
- Git and Github
- CI/CD Pipelines
You might be an especially great fit if you:
- Constantly think of ways to squeeze better performance out of a data platform
- Plan effective data storage, security, sharing, and publishing within an organization
- Design boilerplate architecture that can abstract underlying technology from end users
- Design, manage, and test disaster recovery procedures for a variety of data platforms
- Value code simplicity and performance
- Obsess over data: everything needs to be accounted for and be thoroughly tested
- Build great things alone, but the greatest things in collaboration with others
Nice to haves:
- You are deeply familiar with Spark and/or Hive
- You have expert experience with Airflow
- You understand the intricacies between different storage formats like Parquet, Avro, Arrow, and JSON
- You are familiar with deployment and configuration tools such as Kubernetes, Drone, and Terraform
- You have experience building microservices
- You’ve built an end-to-end production-grade data platform that runs on AWS
- You have experience building a machine learning platform using tools like SparkML, Tensorflow, Scikit-Learn, etc.
As a Data Platform Engineer, you will:
- Build a large-scale batch and real-time platform that will make data pipelines seamless and scalable
- Help drive best practices in continuous integration and delivery
- Help drive optimization, testing, and tooling to improve data platform quality
- Collaborate with other software engineers, machine learning experts, and stakeholders, taking learning and leadership opportunities that will arise every single day
- In three months you will have familiarized yourself with much of our data platform, be making regular contributions to our codebase, will be collaborating regularly with stakeholders to widen your knowledge and helping to resolve incidents and respond to user requests
- In six months you will have successfully investigated, scoped, executed, and documented a small to medium sized project and worked with stakeholders to make sure their data needs are satisfied by implementing improvements to our platform
- In a year you will have become the key person for several projects within the team and will have contributed to the the data platform’s roadmap. You will have made several sizable contributions to the project and are regularly looking to improve the overall stability and scalability of the architecture
*MongoDB is an equal opportunities employer*
MongoDB was founded in 2007 by Dwight Merriman, Eliot Horowitz and Kevin Ryan – the team behind DoubleClick.
At the Internet advertising company DoubleClick (now owned by Google), the team developed and used many custom data stores to work around the shortcomings of existing databases. The business served 400,000 ads per second, but often struggled with both scalability and agility. Frustrated, the team was inspired to create a database that tackled the challenges it faced at DoubleClick.
MongoDB is the leading modern, general purpose database platform, designed to unleash the power of software and data for developers and the applications they build. Headquartered in New York, with offices across North America, Europe, and Asia-Pacific, we are close to where you do business. MongoDB has more than 4,300 customers in over 85 countries, including more than half of the global Fortune 100. The MongoDB database platform has been downloaded over 30 million times and there have been more than 730,000 MongoDB University registrations.