β
In high-performing organizations, everyone within the team shares a common goalβquality, availability, and security arenβt the responsibility of individual departments, but are a part of everyoneβs job, every day.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Ask a programmer to review ten lines of code, heβll find ten issues. Ask him to do five hundred lines, and heβll say it looks good.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
management hints that the person guilty of committing the error will be punished. They then create more processes and approvals to prevent the error from happening again.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
By relentless and constant experimentation in their daily work, they were able to continually increase capacity, often without adding any new equipment or hiring more people.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Every company is a technology company, regardless of what business they think theyβre in. A bank is just an IT company with a banking license.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
integrating the objectives of QA and Operations into everyone's daily work reduces firefighting, hardship, and toil, while making people more productive and increasing joy in the work we do.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Cagan notes that when organizations do not pay their β20% tax,β technical debt will increase to the point where an organization inevitably spends all of its cycles paying down technical debt.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
However, interrupting technology workers is easy, because the consequences are invisible to almost everyone, even though the negative impact to productivity may be far greater than in manufacturing.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Organizations which design systems ...are constrained to produce designs which are copies of the communication structures of the organizations.. the larger an organization is, the less flexibility it has and the more pronounced the phenomenon
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Bill Baker, a distinguished engineer at Microsoft, quipped that we used to treat servers like pets: βYou name them and when they get sick, you nurse them back to health. [Now] servers are [treated] like cattle. You number them and when they get sick, you shoot them.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Broadly speaking, to acheive DevOps outcomes, we need to reduce the effects of functional orientation ("optimizing for cost") and enable market orientation ("optimizing for speed") so we can have many small teams working safely and independently, quickly delivering value to the customer
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Itβs difficult to overstate the enormity of this problemβit affects every organization, independent of the industry we operate in, the size of our organization, whether we are profit or non-profit. Now more than ever, how technology work is managed and performed predicts whether our organizations will win in the marketplace, or even survive.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Many psychologists assert that creating systems that cause feelings of powerlessness is one of the most damaging things we can do to fellow human beingsβwe deprive other people of their ability to control their own outcomes and even create a culture where people are afraid to do the right thing because of fear of punishment, failure, or jeopardizing their livelihood
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
However, interrupting technology workers is easy, because the consequences are invisible to almost everyone, even though the negative impact to productivity may be far greater than in manufacturing. For instance, an engineer assigned to multiple projects must switch between tasks, incurring all the costs of having to re-establish context, as well as cognitive rules and goals.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
high performance starts with organizations whose leadership focuses on building an environment where people from different backgrounds and with different identities, experiences, and perspectives can feel psychologically safe working together, and where teams are given the necessary resources, capacity, and encouragement to experiment and learn together in a safe and systematic way.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
DevOps benefits all of us in the technology value stream, whether we are Dev, Ops, QA, Infosec, Product Owners, or customers. It brings joy back to developing great products, with fewer death marches. It enables humane work conditions with fewer weekends worked and fewer missed holidays with our loved ones. It enables teams to work together to survive, learn, thrive, delight our customers, and help our organization succeed.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
DevOps requires potentially new cultural and management norms and changes in our technical practices and architecture. This requires a coalition that spans business leadership, Product Management, Development, QA, IT Operations, Information Security, and even Marketing, where many technology initiatives originate. When all these teams work together, we can create a safe system of work, enabling small teams to quickly and independently develop and validate code that can be safely deployed to customers. This results in maximizing developer productivity, organizational learning, high employee satisfaction, and the ability to win in the marketplace.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
In typical DevOps transformations, as we progress from deployment lead times measured in months or quarters to lead times measured in minutes, the constraint usually follows this progression: Environment creation: We cannot achieve deployments on-demand if we always have to wait weeks or months for production or test environments. The countermeasure is to create environments that are on demand and completely self-serviced, so that they are always available when we need them. Code deployment: We cannot achieve deployments on demand if each of our production code deployments take weeks or months to perform (i.e., each deployment requires 1,300 manual, error-prone steps, involving up to three hundred engineers). The countermeasure is to automate our deployments as much as possible, with the goal of being completely automated so they can be done self-service by any developer. Test setup and run: We cannot achieve deployments on demand if every code deployment requires two weeks to set up our test environments and data sets, and another four weeks to manually execute all our regression tests. The countermeasure is to automate our tests so we can execute deployments safely and to parallelize them so the test rate can keep up with our code development rate. Overly tight architecture: We cannot achieve deployments on demand if overly tight architecture means that every time we want to make a code change we have to send our engineers to scores of committee meetings in order to get permission to make our changes. Our countermeasure is to create more loosely-coupled architecture so that changes can be made safely and with more autonomy, increasing developer productivity.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
DevOps Evangelist An expert Consultant who can now evangelise a DevOps solution for an Enterprise. Typical tasks: Scaling DevOps capabilities across Enterprise Change Management Organisation Alignment This is a high-end Consulting role, with a heavy technology bend. At a DevOps Engineer level, itβs mostly about deep technical skills. As you evolve into a solution architect kind of role, along with the technical acumen, itβs also about leadership skills to build consensus/agreements and making a team work together.
β
β
Savinder Puri (How do I build a career in DevOps?: A practical handbook to help you start or scale up your career in DevOps)
β
Udemy Coursera Simplilearn Skillshare
β
β
Savinder Puri (How do I build a career in DevOps?: A practical handbook to help you start or scale up your career in DevOps)
β
Stack Overflow profile
β
β
Savinder Puri (How do I build a career in DevOps?: A practical handbook to help you start or scale up your career in DevOps)
β
GitHub Profile
β
β
Savinder Puri (How do I build a career in DevOps?: A practical handbook to help you start or scale up your career in DevOps)
β
The infrastructure that supports our continuous integration and continuous deployment processes also presents a new surface area vulnerable to attack. For instance, if someone compromises the servers running the deployment pipeline that has the credentials for our version control system, it could enable someone to steal source code. Worse, if the deployment pipeline has write access, an attacker could also inject malicious changes into our version control repository and, therefore, inject malicious changes into our application and services.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
By speeding up flow through the technology value stream, we reduce the lead time required to fulfill internal or customer requests, especially the time required to deploy code into the production environment. By doing this, we increase the quality of work as well as our throughput and boost our ability to innovate and out-experiment the competition. The resulting practices include continuous build, integration, test, and deployment processes, creating environments on demand, limiting work in process (WIP), and building systems and organizations that are safe to change.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
Three Ways: Flow, Feedback, and Continual Learning and Experimentation.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
In Operations, we may deal with this problem with the following rule of thumb: When something goes wrong in production, we just reboot the server. If that doesnβt work, reboot the server next to it. If that doesnβt work, reboot all the servers. If that doesnβt work, blame the developers, theyβre always causing outages.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Without automated testing, continuous integration is the fastest way to get a big pile of junk that never compiles or runs correctly.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Compounding the issue, the person performing the work often has little visibility or understanding of how their work relates to any value stream goals (e.g., βIβm just configuring servers because someone told me to.β). This places workers in a creativity and motivation vacuum.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
measure:9 β’deployment frequency β’deployment cycle time β’change failure rate β’development cycle time β’number of incidents β’mean time to recover (MTTR)
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
And by 2010, with the introduction of DevOps and the never-ending commoditization of hardware, software, and now the cloud, features (even entire startup companies) could be created in weeks, quickly being deployed into production in just hours or minutesβfor these organizations, deployment finally became routine and low risk.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
These organizations are able to perform experiments to test business ideas, discovering which ideas create the most value for customers and the organization as a whole, and which are then further developed into features that can be rapidly and safely deployed into production.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
Today, organizations adopting DevOps principles and practices often deploy changes hundreds or even thousands of times per day.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
In an age where competitive advantage requires fast time to market and relentless experimentation, organizations that are unable to replicate these outcomes are destined to lose in the marketplace to more nimble competitors and could potentially go out of business entirely, much like the manufacturing organizations that did not adopt Lean principles.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
regardless of what industry we are competing in, the way we acquire customers and deliver value to them is dependent on the technology value stream.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
Itβs difficult to overstate the enormity of this problemβit affects every organization, independent of the industry we operate in, the size of our organization, whether we are profit or nonprofit.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
Now more than ever, how technology work is managed and performed predicts whether our organizations will win in the marketplace or even survive.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
deliver working software frequently, from a couple of weeks to a couple of months, with a preference to the shorter timescale.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
βEvent severity: How severe was this issue? This directly relates to the impact on the service and our customers.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
How long after we knew there was a problem did it take for us to restore service?
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
deployments occur throughout the business day when everyone is already in the office and without our customers even noticingβexcept when they see new features and bug fixes that delight them.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
And, by deploying code in the middle of the workday, IT Operations is working during normal business hours like everyone else for the first time in decades.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
Instead of security and compliance activities only being performed at the end of the project, controls are integrated into every stage of daily work in the software development life cycle, resulting in better quality, security, and compliance outcomes.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
This book and the DevOps community as a whole have shown time and time again that DevOps practices and processes can take even the most legacy-riddled, old βhorseβ enterprise organization and turn it into a nimble technology organization.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
In 2021, it is clearer than ever before that every business is a technology business and every leader is a technology leader.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
Part III describes how to accelerate flow by building the foundations of our deployment pipeline: enabling fast and effective automated testing, continuous integration, continuous delivery, and architecting for low-risk releases.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
Part IV discusses how to accelerate and amplify feedback by creating effective production telemetry to see and solve problems, better anticipate problems and achieve goals,
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
Part V describes how we accelerate continual learning and experimentation by establishing a just culture, converting local discoveries into global improvements,
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
Three Ways: Flow, Feedback, and Continual Learning
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
how to properly integrate security and compliance
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
increase the success of DevOps initiatives,
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
technology value stream as the process required to convert a business hypothesis into a technology-enabled service or feature that delivers value to the customer.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
The Third Way enables the creation of a generative, high-trust culture that supports a dynamic, disciplined, and scientific approach to experimentation and risk-taking, facilitating the creation of organizational learning, both from our successes and failures.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
working in small batches (the First Way), fast feedback and monitoring (the Second Way), and a generative culture (the Third Way).
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
DevOps isnβt about automation, just as astronomy isnβt about telescopes.β2
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
While many of the DevOps patterns shown in this book require automation, DevOps also requires cultural norms and an architecture that allows for the shared goals to be achieved throughout the IT value stream.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
DevOps principles and practices are compatible with Agile,
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
Agile often serves as an effective enabler of DevOps because of its focus on small teams continually delivering high-quality code to customers.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, & Security in Technology Organizations)
β
DevOps and its resulting technical, architectural, and cultural practices represent a convergence of many philosophical and management movements (including): Lean, Theory of Constraints, Toyota production system, resilience engineering, learning organizations, safety culture, Human factors, high-trust management cultures, servant leadership, organizational change management, and Agile methods.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Stop starting. Start finishing.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
In any value stream, there is always a direction of flow, and there is always one and only one constraint; any improvement not made at that constraint is an illusion.β If
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
The deal [between product owners and] engineering goes like this: Product management takes 20% of the teamβs capacity right off the top and gives this to engineering to spend as they see fit. They might use it to rewrite, re-architect, or re-factor problematic parts of the code base...whatever they believe is necessary to avoid ever having to come to the team and say, βwe need to stop and rewrite [all our code].β If youβre in really bad shape today, you might need to make this 30% or even more of the resources. However, I get nervous when I find teams that think they can get away with much less than 20%.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
When every team expedites their work, the net result is that every project ends up moving at the same slow crawl.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
In high-performing organizations, everyone within the team shares a common goalβquality, availability, and security arenβt the responsibility of individual departments but are a part of everyoneβs job, every day.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Our goal with a product-based funding model is to value the achievement of organizational and customer outcomes, such as revenue, customer lifetime value, or customer adoption rate, ideally with the minimum of output (e.g., amount of effort or time, lines of code). Contrast this to how projects are typically measured, such as whether it was completed within the promised budget, time, and scope.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
By doing this, Development and Operations may end up creating a shared work queue, instead of each silo using a different one (e.g., Development uses JIRA while Operations uses ServiceNow). A significant benefit of this is that when production incidents
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
DevOps isnβt about automation, just as astronomy isnβt about telescopes.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
high performers use a disciplined approach to solving problems. This is in contrast to the more common practice of using rumor and hearsay, which can lead to the unfortunate metric of mean time until declared innocentβhow quickly can we convince everyone else that we didnβt cause the outage. When there is a culture of blame around outages and problems, groups may avoid documenting changes and displaying telemetry where everyone can see them to avoid being blamed for outages.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
The work done at Netflix highlights one very specific way we can use telemetry to mitigate problems before they impact our customer.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Alert fatigue is the single biggest problem we have right nowβ¦We need to be more intelligent about our alerts or weβll all go insane.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
In Operations, many of our data sets have what we call βchi squaredβ distribution. Using
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Over-alerting causes Operations engineers to be woken up in the middle of the night for protracted periods of time, even when there are few actions that they can appropriately take. The
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Chuck Rossi, Director of Release Engineering at Facebook, described, βAll the code supporting every feature weβre planning to launch over the next six months has already been deployed onto our production servers. All we need to do is turn it on.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Much of my career has involved rewrites of critical systems. You would think such a thing is easyβjust make the new one do what the old one did. Yet they are always much more complex than they seem, and overflowing with risk. The big cut-over date looms, and the pressure is on. While new features (there are always new features) are liked, old stuff has to remain. Even old bugs often need to be added to the rewritten system.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Up-front analysis helps us identify the smallest possible piece of work that will usefully achieve a business outcome using
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
To even get feedback from our integration process would require twenty-four to thirty-six hours.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
the architecture that our services operate within dictates how we test and deploy our code. This
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
creating application and infrastructure telemetry to be one of the highest return investments weβve made. In
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Gatekeeper, the Facebook feature toggling service.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
The key thing we should care about is not the form, but the outcomes: deployments should be low-risk, push-button events we can perform on demand.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
telemetry is what enables us to assemble our best understanding of reality and detect when our understanding of reality is incorrect.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Once we have centralized our logs, we can transform them into metrics by
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
We should also collect telemetry on how long it takes us to execute our builds and tests. By
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Monitoring is so important that our monitoring systems need to be more available and scalable than the systems being monitored.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
we must ensure that the applications we build and operate are creating sufficient telemetry.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
every Facebook user was part of a massive load testing program, which
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
This is the reality of operating complex systems; no single person can see the whole system and understand how all the pieces fit together.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
we need to design our systems so that they are continually creating telemetry, widely
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
However, for decades we have ended up with silos of information, where Development only creates logging events that are interesting to developers, and Operations only monitors whether the environments are up or down. As
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
When deciding whether a message should be ERROR or WARN, imagine being woken up at 4 a.m. Low printer toner is not an ERROR.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
we should ensure that all potentially significant application events generate logging entries, including
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
common practice of using rumor and hearsay, which can lead to the unfortunate metric of mean time until declared innocentβhow quickly can we convince everyone else that we didnβt cause the outage.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
should be as easy as writing one line of code to create a new metric that shows up in a common dashboard where everyone in the value stream can see it.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
It was important to us that for a developer, adding production telemetry didnβt feel as difficult as doing a database schema change.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
all team members as well as passers-by can see the latest information at a glance: count of automated tests, velocity, incident reports, continuous integration status, and so on. This
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
To have humans executing tests that should be automated is a waste of human potential.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
In addition to collecting telemetry from our production services and environments, we must also collect telemetry from our deployment pipeline when important events occur, such as when our automated tests pass or fail and when we perform deployments to any environment. We
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
Having developers share responsibility for the quality of the systems they build not only improves outcomes but also accelerates learning.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
INCREASE THE VISIBILITY OF WORK In order to be able to know if we are making progress toward our goal, itβs essential that everyone in the organization knows the current state of work. There are many ways to make the current state visible, but whatβs most important is that the information we display is up to date, and that we constantly revise what we measure to make sure itβs helping us understand progress toward our current target conditions.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)
β
What these organizations have in common is a high-trust culture that enables all departments to work together effectively, where all work is transparently prioritized and there is sufficient slack in the system to allow high-priority work to be completed quickly. This is, in part, enabled by automated self-service platforms that build quality into the products everyone is building.
β
β
Gene Kim (The DevOps Handbook: How to Create World-Class Agility, Reliability, and Security in Technology Organizations)