Pareto Chart 101: Visualizing the 80-20 Rule | Mode (2024)

Pareto Chart 101: Visualizing the 80-20 Rule | Mode (1)

Joel Carron, Data Scientist at Mode

You've probably heard of the Pareto principle, even if you didn't realize it at the time. Here are a few examples:

  • 80% of web traffic comes from 20% of your site's pages.

  • 80% of customer complaints are caused by 20% of product defects.

  • 80% of revenue comes from 20% of your customers.

Starting to sound familiar? The Pareto principle (also known as the 80-20 rule) applies to a wide range of fields from natural science to sports, but it's particularly suited to solving business problems, like those above. Conducting Pareto analysis allows you to discern the vital few from the trivial many and prioritize actions.

The history of the Pareto principle

In the 19th century, Vilfredo Pareto observed that 20% of the pea pods in his garden contained 80% of the peas. Pareto also happened to be an economist, and the pea pod discovery got him thinking. In 1896, he published a paper demonstrating that around 80% of the land in Italy was owned by 20% of the population.

Later, Joseph M. Juran came across Pareto's work and realized how relevant this idea was to quality management issues. As an engineer, he had noticed that a small number of defects caused the bulk of issues. He christened the pattern the Pareto principle. Since then the term has been used to describe any phenomenon where a small percentage of factors is responsible for a large percentage of the effect.

The Pareto chart

The Pareto chart is a visual representation of the 80-20 rule, featuring a bar + line chart. The bars represent the value of each item on your list (arranged in descending order), and the line indicates the cumulative percentage of those values.

Visualizing a Pareto analysis of your data allows you to quickly pick out where you should focus your time and resources. Take, for example, this dataset of consumer complaints submitted to the Consumer Financial Protection Bureau (a government agency working to protect consumers in the financial sector). Each complaint is bucketed into one of 94 distinct issue types, which might be too many for the CFPB to focus their policy efforts on all at once. Looking at a Pareto chart of consumer complaints will help them figure out where to start.

Click on the image above to see the full-size chart. To see the complete Python notebook generating this Pareto Chart, click here.

The CFPB's consumer complaint distribution follows the Pareto principle to a T. The top 20% of issues generate just over 80% of all complaints. This means that the CFPB can effectively work to address 80% of concerns by focusing on only the top 19 issues (compared to the total 94).

Not every distribution you come across will follow the 80-20 rule as closely as the CFPB's issue types. The Pareto principle assumes a long tail distribution, and not every dataset will follow that exact pattern. That said, a Pareto chart is an effective way of evaluating potential impact when selecting a subset of focus areas from a long list, regardless of the distribution.

Make your own Pareto chart

You can make your own Pareto chart for any list of things that have values associated with them (e.g. a list of blog post URLs and the number of web sessions for each, a list of accounts and the revenue they bring in). Check out the code in Mode and follow these steps:

Step 1: Pull data with SQL

Use a query like this one that pulls values for the items on your list. Note that a COUNT works here because you're counting complaints submitted. For numerical or financial data, you'll likely want to use a SUM instead.

SELECT issue, COUNT(DISTINCT complaint_id) as total FROM modeanalytics.consumer_complaintsGROUP BY 1ORDER BY 2 DESC
Step 2: Plot in Python with Plotly

Next, copy this code into your Python notebook.

Before you create the plot, you need to make a new column for the cumulative percentage, which is what you'll use for the line in your chart.

df['cumulative_sum'] = df.total.c*msum()df['cumulative_perc'] = 100*df.cumulative_sum/df.total.sum()

In this example, the DataFrame columns are named issue and total. If your columns are called something else, you'll need to update the Python code where applicable.

Optional Step 3: Customize HTML

The report page of this example also uses some custom HTML. The first code block sets the background color of the report page to match the background color used for the plot.

<style>body {background: rgb(240, 240, 240) !important;}.chart {background-color: rgb(240, 240, 240);border: none;}.mode-embed.mode-object {border: none;background: rgb(240, 240, 240);}.js-table-container {background-color: rgb(240, 240, 240);}</style>

The second code block increases the height of the chart area on the report page.

<script>$("#python_52c4404842c0 .chart-content").css("height",700)$("#python_52c4404842c0 .mode-python").css("height",700)$("#python_52c4404842c0 .mode-python").css("max-height",700)$("#python_52c4404842c0 img").css("max-height",700)</script>

Simply update the #python_ tags to match the chart id, which you'll find in a line that looks like this:

<mode-python id="python_52c4404842c0"options="python_options"></mode-python>

For more information on customizing the HTML of Mode reports, click here. And for more custom examples and inspiration, be sure to check out the Mode Gallery.

Additionally, here are some other tools and tutorials for making your own Pareto chart:

Recommended articles
  • 10 Useful Python Data Visualization Libraries for Any Discipline

  • Thinking in SQL vs Thinking in Python

  • 11 Data Experts Who Will Constantly Inspire You

Pareto Chart 101: Visualizing the 80-20 Rule | Mode (2024)

FAQs

How do you interpret the 80-20 rule in Pareto chart? ›

The Pareto principle states that for many outcomes, roughly 80% of consequences come from 20% of causes. In other words, a small percentage of causes have an outsized effect. This concept is important to understand because it can help you identify which initiatives to prioritize so you can make the most impact.

How do you visualize the 80-20 rule? ›

The Pareto chart is a visual representation of the 80-20 rule, featuring a bar + line chart. The bars represent the value of each item on your list (arranged in descending order), and the line indicates the cumulative percentage of those values.

What is an example of the 80-20 rule Pareto? ›

So, here are some Pareto 80 20 rule examples:
  • 20% of criminals commit 80% of crimes.
  • 20% of drivers cause 80% of all traffic accidents.
  • 80% of pollution originates from 20% of all factories.
  • 20% of a companies products represent 80% of sales.
  • 20% of employees are responsible for 80% of the results.
Mar 3, 2017

How do you calculate Pareto 80-20? ›

If 80% of 80% of business comes from 20% of the 20% of the customers, it's (0.80 x 0.80) / (0.20 x 0.20). This means that 64% of business comes from 4% of the customers. That is 80/20 squared or (80/20)2.

What is an example of the 80-20 rule at work? ›

For business sales, 20% of a company's repeat customers should be responsible for 80% of the sales. Also, 20% of the employees are responsible for 80% of the results. For project management, the first 20% of the effort put in on a project should yield 80% of the project's results.

How do you interpret data in a Pareto chart? ›

The left vertical axis of the Pareto chart has "counts" or "cost" depending on the data used. Each vertical bar represents the contribution to the total from a given "problem" area. The bars are placed on the graph in rank order, that is the bar at the left has the highest contribution to counts or cost.

What is the best chart to show 80-20 rule? ›

The Pareto Chart is a very powerful tool for showing the relative importance of problems. It contains both bars and lines, where individual values are represented in descending order by bars, and the cumulative total of the sample is represented by the curved line.

What is the 80-20 rule often called the Pareto Principle? ›

The Pareto principle, also known as the 80/20 rule, is a theory maintaining that 80 percent of the output from a given situation or system is determined by 20 percent of the input. The principle doesn't stipulate that all situations will demonstrate that precise ratio – it refers to a typical distribution.

What is the 80-20 rule quizlet? ›

The Pareto principle (also known as the 80-20 rule, the law of the vital few, and the principle of factor sparsity) states that, for many events, roughly 80% of the effects come from 20% of the causes. From a business vantage, "80% of your sales come from 20% of your clients".

What is Pareto 80-20 rule quotes? ›

The quote "Simplicity, clarity, and focus are the hallmarks of efficiency and effectiveness" perfectly encapsulates the essence of the 80/20 principle. This principle, also known as the Pareto Principle, suggests that 80% of the results are derived from 20% of the causes.

What is the most productive way to apply the 80-20 rule? ›

Prioritize the first 20% of your workday regarding the tasks you complete and know when it's time to pivot and make changes when working on the remaining 80% to ensure you don't waste too much productive time and energy.

What is the 80-20 rule in Pareto analysis inventory? ›

80-20 Inventory Rule Definition

The 80/20 inventory rule states that 80% of your profits should come from 20% of your inventory. The rule is based on the Pareto Principle, a management consulting principle that suggests that 80% of effects come from 20% of causes.

What is the purpose of the Pareto analysis or 80-20 rule? ›

The 80-20 rule, also known as the Pareto Principle, is a familiar saying that asserts that 80% of outcomes (or outputs) result from 20% of all causes (or inputs) for any given event. In business, a goal of the 80-20 rule is to identify inputs that are potentially the most productive and make them the priority.

What is the Pareto formula? ›

Collect the raw data, including the category (cause of a problem) and their count. Calculate the percentage of each category and further compute the cumulative percent. The percent can be calculated using the formula =(C3/$C$13) *100, applied throughout the other cells.

What are the interpretations of 80/20 rule Pareto analysis in testing? ›

According to the Pareto Principle, in any group of things that contribute to a common effect, a relatively few contributors account for the majority of the effect. Commonly, it is found that: 80% of complaints come from 20% of customers. 80% of sales come from 20% of clients.

What is the 80 20 rule in Pareto analysis inventory? ›

80-20 Inventory Rule Definition

The 80/20 inventory rule states that 80% of your profits should come from 20% of your inventory. The rule is based on the Pareto Principle, a management consulting principle that suggests that 80% of effects come from 20% of causes.

What does the Pareto Principle suggest 20% is to action as 80% is to? ›

The Pareto Principle is a concept that suggests that 80% of the end results of an action are due to 20% of causes. That's why this principle is also called the 80/20 rule.

What is the 80 20 rule in identifying key tasks? ›

Simply put, the 80/20 rule states that the relationship between input and output is rarely, if ever, balanced. When applied to work, it means that approximately 20 percent of your efforts produce 80 percent of the results.

References

Top Articles
Latest Posts
Article information

Author: Msgr. Refugio Daniel

Last Updated:

Views: 6212

Rating: 4.3 / 5 (54 voted)

Reviews: 85% of readers found this page helpful

Author information

Name: Msgr. Refugio Daniel

Birthday: 1999-09-15

Address: 8416 Beatty Center, Derekfort, VA 72092-0500

Phone: +6838967160603

Job: Mining Executive

Hobby: Woodworking, Knitting, Fishing, Coffee roasting, Kayaking, Horseback riding, Kite flying

Introduction: My name is Msgr. Refugio Daniel, I am a fine, precious, encouraging, calm, glamorous, vivacious, friendly person who loves writing and wants to share my knowledge and understanding with you.