Data Engineer

Cerebri AI, Austin, TX

Turn data into revenue

Cerebri AI, a venture-backed pioneer in artificial intelligence and machine learning, is the creator of Cerebri Values™, the industry’s first universal measure of customer success. Cerebri Values quantifies each customer’s commitment to a brand or product and dynamically predicts “Next Best Actions” at scale, which enables large companies to focus on accelerating profitable growth. Deployed as a SaaS application running on Microsoft Azure, Cerebri Values operates behind the corporate firewall, ensuring the highest level of security and safeguarding personal information. Headquartered in Austin with offices in Toronto and Washington, DC, the company has over 50 employees who have been awarded over 130 patents to date. To learn more, visit cerebriai.com.
Role: Design, develop and build out data pipelines to ingest data into our proprietary data structures, and be a key collaborator in the data discovery and exploratory analysis process during our client engagements.


  • Build pipelines to ingest and maintain complex data sets into Cerebri AI’s proprietary data stores for use in machine learning modeling
  • Develop and maintain data ontologies for key market segments
  • Collaborate with data scientists to perform exploratory data analysis and to map data fields into proprietary data stores and to find signals in client data
  • Collaborate with clients to develop pipeline infrastructure, and to ask appropriate questions to gain deep understanding of client data
  • Write quality documentation on the discovery process and software projects
  • Work equally well in a team environment and on your own.
  • Communicate complex ideas clearly with both team members and clients
  • Travel up to 25%


  • At least one (1) year of experience designing and building data processing solutions and ETL pipelines for varied data formats, ideally at a company that leverages machine learning models
  • At least two (2) years of experience in SQL, Python, Apache Spark, pyspark
  • Experience working directly with relational database structures and flat files  
  • Ability to write efficient database queries, functions and views to include complex joins and the identification and development of custom indices
  • Knowledge of professional software engineering practices and best practices for the full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, continuous integration and development, and operations.
  • Good verbal and written communication skills, with both technical and non-technical stakeholders

Nice to Haves

  • Experience in Java and/or Scala
  • Experience with data management processing tools such as Kafka, Elasticsearch and Logstash
  • Experience with NoSQL distributed databases such as Cassandra.
  • Experience in business intelligence visualization tools such as Grafana, Superset, Redash or Tableau.
  • Experience with Microsoft Azure or similar cloud computing solutions
  • Master’s degree or higher in a relevant quantitative subject

About Cerebri AI

Cerebri AI provides AI and machine learning solutions to help enterprises grow top line revenues by giving them a 1:1 relationship with their customers. We do this by processing internal and external customer data, and by determining the dollar value a customer places on the “value” of a vendor, products, assets, etc. We also monetize a critical variable in any revenue situation, the customer’s ability to pay, so things such as up-selling opportunities can be clearly scoped and delivered. We call the results Customer Value Indexes (CVIs) for brands, vendors, assets and financing.

Cerebri AI

Want to learn more about Cerebri AI? Visit Cerebri AI's website.