Article

Canary Testing

August 15, 2023

The goal of canary testing is to detect and address issues or bugs in the new version before it is released to a wider audience.

The goal of canary testing is to detect and address issues or bugs in the new version before it is released to a wider audience.

What is Canary testing?

Canary testing is a type of software testing that involves deploying a new version of an application or system to a small subset of users or servers in a production environment before rolling it out to the entire system. The goal of canary testing is to detect and address issues or bugs in the new version before it is released to a wider audience.

Canary testing works by selecting a small subset of users or servers, known as canaries, to receive the new version of the application or system. The canaries are typically chosen based on specific criteria, such as their location, behavior, or role, and are closely monitored for any issues or anomalies. If issues are detected, the new version can be rolled back or fixed before it affects the wider system.

Where does the term 'canary testing' come from?

The term "canary testing" is derived from the practice of using canaries in coal mines to detect toxic gasses. Canaries are particularly sensitive to carbon monoxide and other poisonous gasses, so they were often used as an early warning system for miners. If the canaries stopped singing or showed other signs of distress, it was a sign that there was a dangerous buildup of gasses in the mine.

Similarly, in software development, canary testing refers to the practice of deploying a new version of an application or system to a small group of users or servers first, in order to detect any issues or bugs before rolling it out to the wider user base. The small group of users or servers act as the "canaries," and any issues detected during the testing process can be addressed before the new version is deployed to the wider system.

Is it a popular methodology?

Canary development is a well-known and widely used methodology in software development, particularly for web applications and services. Many large companies, such as Google, Amazon, and Netflix, have successfully implemented Canary deployments to reduce the risk of issues and improve the quality of their software releases.

What are the advantages of Canary testing?

  1. Early detection of issues: Canary testing can detect issues in a new version of an application or system before it is rolled out to the wider user base, allowing them to be addressed before they impact a large number of users.
  2. Reduced risk: By testing the new version with a small group of users or servers first, canary testing reduces the risk of introducing bugs or other issues that could negatively impact the wider system.
  3. Improved stability: Canary testing can lead to a more stable and reliable product by identifying and addressing issues early in the development process.
  4. Faster deployment: Once issues have been detected and addressed during canary testing, the new version can be deployed to the wider user base more confidently and quickly.
  5. Increased confidence: Canary testing can give developers and stakeholders greater confidence in the stability and reliability of the new version, reducing the risk of costly rollbacks or delays in deployment.

What are the disadvantages of Canary testing?

The "underwater rocks" of canary testing refer to the hidden or unexpected issues that can arise during the deployment process despite the use of canary testing. These can include:

  1. Canaries not being representative of the wider user base: If the canaries do not represent the wider user base, issues that are present in the wider user base may not be detected during canary testing.
  2. Insufficient monitoring: If the canaries are not closely monitored during the testing process, issues may not be detected and may spread to the wider system.
  3. Inadequate rollback procedures: If issues are detected during canary testing, there must be adequate rollback procedures in place to ensure that the new version can be rolled back without causing disruption to the wider system.
  4. Slow rollout: If the rollout of the new version is too slow, it can lead to delays in delivering new features and improvements to users.
  5. Unforeseen interactions: Canary testing may not detect unforeseen interactions between the new version and other components of the system, which can cause issues in the wider system.
  6. Infrastructure costs: Canary testing requires additional infrastructure to be set up and maintained, which can add to the costs of the deployment.
  7. Overconfidence: There is a risk that the success of canary testing can lead to overconfidence in the new release. This can lead to a failure to adequately test the new version or to assume that issues will be caught during canary testing.

To mitigate these risks, it is important to carefully plan and execute the canary testing process, and to monitor the canaries closely during the rollout process. The canaries should be carefully selected to represent the wider user base, and adequate rollback procedures should be in place in case issues are detected. It is also important to consider other testing techniques, such as A/B testing and feature flagging, to further reduce the risk of issues in the wider system.

Is Canary testing suitable for all types of companies?

Canary testing may not be suitable for all companies or all types of applications or systems. Some factors to consider when deciding whether canary testing is appropriate for a given company or product include:

  1. Complexity: Canary testing may be more appropriate for complex applications or systems that are more likely to have bugs or issues. Simple applications or systems may not require the additional testing that canary testing provides.
  2. Size of user base: If a company has a very small user base, canary testing may not be necessary. However, if the user base is larger, canary testing can be a valuable tool for identifying issues before they impact a large number of users.
  3. Resources: Canary testing requires additional resources, such as additional servers or infrastructure, as well as the resources to monitor and analyze the results of the testing. Companies should ensure that they have the necessary resources to support canary testing before implementing it.
  4. Frequency of releases: Canary testing may be more appropriate for companies that release updates or new versions frequently, as it can help to ensure that each release is thoroughly tested before being rolled out to the wider user base.
  5. Risk tolerance: Companies with a low risk tolerance may find canary testing to be a valuable tool for minimizing the risk of issues or bugs in new releases. However, companies with a higher risk tolerance may not see the value in the additional testing that canary testing provides.

Conclusion

In conclusion, canary testing is a useful technique for testing new versions of applications or systems before rolling them out to the wider user base. It allows companies to detect and address issues early in the development process, reducing the risk of costly rollbacks or delays in deployment. However, it is important to consider factors such as complexity, size of user base, resources, frequency of releases, and risk tolerance before implementing canary testing, as it may not be suitable for all companies or products. Overall, canary testing can be a valuable tool for ensuring the stability and reliability of new releases and improving the user experience.

About the author

author photo: Tamas Cser

Tamas Cser

FOUNDER & CTO

Tamas Cser is the founder, CTO, and Chief Evangelist at Functionize, the leading provider of AI-powered test automation. With over 15 years in the software industry, he launched Functionize after experiencing the painstaking bottlenecks with software testing at his previous consulting company. Tamas is a former child violin prodigy turned AI-powered software testing guru. He grew up under a communist regime in Hungary, and after studying the violin at the University for Music and Performing Arts in Vienna, toured the world playing violin. He was bitten by the tech bug and decided to shift his talents to coding, eventually starting a consulting company before Functionize. Tamas and his family live in the San Francisco Bay Area.

Author linkedin profile