Principal Data Engineer (remote)
ThoughtWorks, New York City, New York
Creative Technology Consultants
Are you at your most vibrant when you’ve successfully distilled data into its simplest, most meaningful form?
Thoughtworks is a global software consultancy with an aim to create a positive impact on the world through technology. Our community of technologists thinks disruptively to deliver pragmatic solutions for our clients' most complex challenges. We are curious minds who come together as collaborative and inclusive teams to push boundaries, free to be ourselves and make our mark in tech.
As consultants, we work with our clients to ensure we’re evolving their technology and empowering adaptive mindsets to meet their business goals. You could influence the digital strategy of a retail giant, build a bold new mobile application for a bank or redesign platforms using event sourcing and intelligent data pipelines. You will learn to use the latest Lean and Agile thinking, create pragmatic solutions to solve mission-critical problems and challenge yourself every day.
Data Engineers develop modern data architecture approaches to meet key business objectives and provide end-to-end data solutions. You might spend a few weeks with a new client on a deep technical review or a complete organizational review, helping them to understand the potential that data brings to solve their most pressing problems. On other projects, you might be acting as the architect, leading the design of technical solutions, or perhaps overseeing a program inception to build a new product. It could also be a software delivery project where you're equally happy coding and tech-leading the team to implement the solution.
You’ll spend time on the following:
- Take the needs and challenges of a client and formulate the technical roadmap and technology solution that will support their business strategies and goals.
- Provide architectural recommendations, solution and approach given trade offs and ability to communicate that to the business.
- Quickly gain an understanding of the landscape of tools and data frameworks so as to recommend next steps, approach (such as real-time streaming, batch, workflows, etc.)
- Credentialize roadmap and architecture
- Enhance Data Engineering capability through coaching, mentoring and leadership
- Co-create and shape strategy and approach to engagements to achieve the desired business outcomes
- Collaborate with other Data Engineer Anchors in the org to learn and share best practices and techniques
- Provide technical leadership in an enterprise environment to ensure delivery of exceptional technical solutions.
- Mentor on approach and execution of solutions, coach on technologies and establishing a team-wide comprehension of solution capabilities and direction.
- Ensure technical expectations of deliverables are met.
- Drive Thought-Leadership on engineering and architectural practices and standards.
- Be an inspiration for innovation to the client.
- Become a trusted and valued partner of the client CIO/CTO and team.
- Maintaining strong expertise and knowledge of current and emerging technologies and products.
- Code! We don’t subscribe to the “post-technical” ivory tower leadership style.
- Assess current state of an organization
Here’s what we’re looking for:
- You are equally happy coding and leading a team to implement a solution
- You have a track record of innovation and expertise in Data Engineering
- You’re passionate about craftsmanship and have applied your expertise across a range of industries and organizations
- You have a deep understanding of data modeling and experience with data engineering tools and platforms such as Kafka, Spark, and Hadoop
- You have built large-scale data pipelines and data-centric applications using Big Data tooling like Hadoop, Spark, Hive, Oozie, and Airflow in a production setting.
- You’ve tackled challenges of persisting, working with, and exposing metadata from data engineering processes using tools such as Apache Atlas, Cloudera Navigator, etc.
- Hands-on experience building Data Engineering tooling with the Microsoft Azure Data Engineering and Analytics stacks including ADLS, Azure Synapse Analytics, Polybase, ADF, Azure Event Hub, Azure Databricks, Active Directory, and PowerBI.
- Hands-on experience with event streaming with modern event streaming tooling like Pulsar, Kafka, Kinesis. Understanding of when streaming vs. batch processing is appropriate, and tradeoffs in a given context
- Hands-on experience with MPP query engines like Presto, Dremio, and Spark SQL.
- You are comfortable applying data security strategy to solve business problems
- You are able to contrast the use of managed services vs custom built ones
A few important things to know:While we’ve traditionally been a traveling consultancy, travel is not required for this role at the moment. We anticipate the need for travel to our client locations in the future when it’s deemed safe.
Not quite ready to apply? Or maybe this isn’t the right role for you? That’s OK, you can stay in touch with AccessThoughtworks, our learning community (click "contact me about recruitment opportunities" to hear about jobs in the future).
It is the policy of Thoughtworks, Inc. to provide a work environment free of discrimination. The Company will take affirmative action to ensure applicants and Thoughtworks employees are treated without regard to race, color, religion, sex/gender, national origin, ethnic origin, veteran or military status, family or marital status, disability, genetic information, age, sexual orientation, gender expression or gender identity. This also includes individuals who are perceived to have any of the aforementioned attributes. Thoughtworks will adhere to all federal, state, and municipal laws and regulations governing employment.
A community of passionate individuals whose purpose is to revolutionize software design, creation and delivery, while advocating for positive social change.
Want to learn more about ThoughtWorks? Visit ThoughtWorks's website.
High-quality tools for hosting, sharing, and streaming videos