Blog
Thoughts on Site Reliability Engineering, automation, and building reliable systems.
-
The AI Skill Gap: When Code Quality Becomes Everyone's Problem
AI hasn't closed the skill gap in software engineering—it's widened it while making it invisible. How code quality suffers when engineers can generate code they can't evaluate.
Read More -
Angel Operations: Equity for Execution
What if instead of raising money to hire a team, you could partner with operators who bring the team directly? Trade equity for execution. Skip the fundraising loop and start building.
Read More -
Mentoring Interns: Lessons from the Other Side
What I learned about mentoring interns after being mentored myself - practical lessons from both sides of the relationship.
Read More -
How I Became an SRE in One Year
From zero IT experience to working as an SRE at a large enterprise - how mentorship and self-directed learning changed my career.
Read More -
Surviving a Ransomware Attack Part 2: Lessons from the Enterprise Applications Group
Hard lessons learned about legacy application recovery, backup realities, and the documentation you wish you had when everything is burning.
Read More -
Surviving a Ransomware Attack: What I Learned Running a War Room for a Month
An insider's perspective on surviving a major ransomware attack and the lessons about teamwork, leadership, and resilience that came from 16-hour days in a war room.
Read More -
Welcome to The SRE Project
Introducing our blog and what you can expect to find here-practical insights from two engineers who learned SRE the hard way.
Read More