By Sarah Pounders
Posted: Updated:
0 Comments

Emilie Schario, data analyst at GitLab, Inc. gave a presentation on her wealth of knowledge about DevOps for Data Engineering.

Overview:

Every change to the web app creates a review app and quality tests. Why are the standards for data teams not the same? At GitLab, they’re adopting DataOps, applying the best practices of the DevOps lifecycle to data, furthering the premise the analytics is a subfield of software engineering. In this presentation, Emilie will share the merge-request-first workflow they’ve adopted at GitLab, and its effects on the business. The entire analytics stack, from ELT to visualization, is version controlled. Any changes are done in merge requests for testability and accountability. Every merge request has its own clone of the data warehouse so that there are no discrepancies between development and production results. Through open source tool dbt, all transformations in the data warehouse are version controlled, and documentation is created and stored. All of the ELT jobs, tests, and builds are orchestrated by GitLab CI. Utilizing these processes has enabled a 3 person data team to support the data needs of a billion dollar company.

Slides

Connect with Emilie

Related

We hosted Dr. Gordon Jones at the Labs in March 2019 to hear his unique and qualified perspective...

Looking for something fun AND educational for kids to do this summer?  We got you covered!  Check...

TechSav hosted a Lunch Function 101: Startups in Savannah: Challenges, Opportunities & Successes in...

No Comment
 
  1. Pingback: February 2019 Goals | She Does Better

Comments are closed.

%d bloggers like this: