By Sarah Pounders
Posted: Updated:
0 Comments

Emilie Schario, data analyst at GitLab, Inc. gave a presentation on her wealth of knowledge about DevOps for Data Engineering.

Overview:

Every change to the web app creates a review app and quality tests. Why are the standards for data teams not the same? At GitLab, they’re adopting DataOps, applying the best practices of the DevOps lifecycle to data, furthering the premise the analytics is a subfield of software engineering. In this presentation, Emilie will share the merge-request-first workflow they’ve adopted at GitLab, and its effects on the business. The entire analytics stack, from ELT to visualization, is version controlled. Any changes are done in merge requests for testability and accountability. Every merge request has its own clone of the data warehouse so that there are no discrepancies between development and production results. Through open source tool dbt, all transformations in the data warehouse are version controlled, and documentation is created and stored. All of the ELT jobs, tests, and builds are orchestrated by GitLab CI. Utilizing these processes has enabled a 3 person data team to support the data needs of a billion dollar company.

Slides

Connect with Emilie

Related

See Walter Woods’ talk from June 2019 at the Labs in Savannah, Georgia sponsored by the Creative...

We all know Python is the best language. But you may be surprised that it’s not because of the easy,...

We hosted Dr. Gordon Jones at the Labs in March 2019 to hear his unique and qualified perspective...

No Comment
 
  1. Pingback: February 2019 Goals | She Does Better

Comments are closed.

%d bloggers like this: