Free

Break It till You Make It: Ideas on Site Reliability Engineering

Event Information

Share this event

Date and Time

Location

Location

Indeed

10721 Domain Drive

Austin, TX 78758

View Map

Event description

Description

On Wednesday, August 21, 2019 at 5:30pm, Indeed will host a Tech Talk in our new Austin office in The Domain. We’ll feature three talks that explore thoughtful approaches to site reliability.

Talks

How I Solved a Complex Equation and Broke Indeed

During his ten years in various roles at Indeed, Ketan Gangatirkar, VP of engineering, has wreaked havoc on the company’s site. In his talk, he’ll showcase creative problem-solving that resulted in unintended and negative consequences. He’ll share the specific lessons learned from each of these adventures. Ketan’s experiences all support the abiding principle that incidents occurring in very complex systems seldom share a single root cause.


How Post-Incident Reviews Help Indeed Deliver Constant Change Safely

Site reliability engineering manager, Alex Elman, will use a recent incident at Indeed to demonstrate the benefits of the post-incident review. Many high-profile events are associated with a seemingly innocuous change. A single change, however, rarely causes an incident alone. By conducting thorough reviews, organizations can learn a lot about how their systems respond to failure. Applying these lessons helps organizations can increase the capacity of their systems to adapt and absorb change.


Why SLOs Are Useful: Scaling an Organization from First Principles

Tristan Slominski, site reliability engineering manager, constantly strives to offer an answer to the question “Why are we doing this?” His talk is a distillation of models that seem to explain why certain known practices work. We understand that "two pizza" teams are about the right size. We know APIs are "good." We adopt SLOs as "good." But did you know that we can explain the effectiveness of these three standards through a single equation commonly referred to as the Universal Scalability Law? They’re all solutions to the problem of managing complexity at different organizational scales.


About the Speakers

Ketan Gangatirkar :: VP, Software Engineering for Job Seeker Products

For the last 10 years, Ketan has helped millions of people get jobs. He has broken Indeed's site in dozens of different and creative ways over the years and has finally learned what not to do. For a time, he was responsible for the Site Reliability Engineering organization at Indeed. He helped the company evolve from centralized operations to a faster, more independent, and more scalable model, so that people like Ketan can't break the site anymore.


Tristan Slominski :: Site Reliability Engineering Manager

Tristan is passionate about design, development, and operation of self-directed teams and decentralized, distributed systems. His past roles include staff software engineer, chief technology officer, and head of product development. Tristan is a former army aviator who served combat tours in Afghanistan and Iraq.


Alex Elman :: Site Reliability Engineering Manager

Alex has studied and practiced resilience engineering at Indeed for seven years. His goal: reduce failure within distributed systems to a boring nonevent. Even after moving into a leadership role, Alex continues to carry a pager because he believes that incident response is always a valuable learning opportunity.



Food and drinks provided. Doors at 5:30. Talk begins at 6:00.

10721 Domain Drive, Austin, TX 78758

We'll see you there!

Share with friends

Date and Time

Location

Indeed

10721 Domain Drive

Austin, TX 78758

View Map

Save This Event

Event Saved