Python Matplotlib BarChart Plots
BarChart Plots in Jupyter Notebooks
This article describes how to create a Bar Charts from a CSV dataset file using Matplotlib. The Python code is written & executed in a Jupyter Notebook. The CSV dataset file (penguins_size.csv) was downloaded from Kaggle https://www.kaggle.com/datasets/parulpandey/palmer-archipelago-antarctica-penguin-data?resource=download.
The article describes how to create Bar Charts to display the Antarctica penguins species and compare their average Body Mass.
Load the dataset into a DataFrame
Write code to use pandas to read the CSV data into a DataFrame and then write a statement to view the data. The DataFrame GroupBy statement is used to display a Count for each species of Penguin. df_penguin.head() could have also been used to take a look at the data.
data:image/s3,"s3://crabby-images/51137/51137ca5f7dfee0ba74da61fd072766a0b66ce4a" alt="Python Data output"
Another dataframe statement to get a look at the data by Grouping the Species and displaying the Average body mass (aka the Mean).
data:image/s3,"s3://crabby-images/af9fa/af9fa58104fa27b7ad2b0ef315f422e54e7c3b26" alt="Python dataframe GroupBy statement to calculate Mean over each Penguin Species"
DataFrame & Lists for the BarChart
Create a New DataFrame - Create a New Dataframe to capture the Grouped Average body mass by Penguin Species. Reset the Index for the new DataFrame.
data:image/s3,"s3://crabby-images/3ecb8/3ecb8c8b0c03adcf0cccac0f31ab751de28c19cb" alt="Create the Grouped Dataframe"
Create Lists for the X & Y axis of the Bar Chart. Each statement will create separate Lists to be used in the matplotlib code and produce the chart.
data:image/s3,"s3://crabby-images/c8f08/c8f08d7efb22f9bd64401390d204e99171464fe6" alt="Convert dataframe to Lists"
Note:If the Index is not Reset when creating the DataFrame for df_penguins, the query for the Species as a separate List will not work.
data:image/s3,"s3://crabby-images/11808/1180805286b5bb339232362aabcbb5eee8d3ad50" alt="Index Not Reset in DataFrame"
data:image/s3,"s3://crabby-images/7874a/7874a5883619bb4a1ad4baf63b7a9331df449b48" alt="KeyError Failure when Index is Not reset"
Matplotlib BarChart - Create the code to output the chart to include the Size, Title, Y-Axis, and data for plt.bar.
plt.bar arguments are the X-axis values (penguin species names), Y-axis values (body mass averages).
data:image/s3,"s3://crabby-images/b30db/b30db2b581239f5392a3e30b3218a262e714feea" alt="Simple matplotlib bar chart code"
The Bar Chart should appear after running the code. The addition of the values for each bar would be helpful.
data:image/s3,"s3://crabby-images/68419/68419fb971f0bd21dee464a5947ad103d96cd1f2" alt="Simple matplotlib bar chart image"
Updated Version of Matplotlib BarChart - a barchart with additional information.
Write code to produce a data to extract the Body Mass average by Species & Sex for comparison.
data:image/s3,"s3://crabby-images/fcaef/fcaef6ebe85cbd86a6b7e0ba072e4dd05f49eb71" alt="Penguin Body Mass by Species & Sex"
Separate the male and female penguins into 2 data frames (df_female & df_male). Create 2 new data frames for the Average Body Mass of the male and female penguin species (df_female_penguins & df_male_penguins).
data:image/s3,"s3://crabby-images/5634a/5634afa07284d437925bdfd74306b52073ff6214" alt="2 Data Frames to split males & females + 2 new data frames with Average Body Mass"
Create 2 Lists from the data frames: df_female_penguins & df_male_penguins.
data:image/s3,"s3://crabby-images/bcaf6/bcaf6c27b536d8e17825d1af2682b14ca216cb0e" alt="Create Lists of Average Body Mass for both Data Frames"
Generate the Penguin Species List for the X axis from the data frame df_penguins. (also shown up above)
data:image/s3,"s3://crabby-images/08d24/08d247ae39af8d8e55315d89f4c6efc2fba5e600" alt="Create List of Penguin Species"
Update the 2 Lists to Round the number of digits after the decimal to 2 (to shorten the 12 digits after the decimal) for both male and females lists (m_list & f_list)
data:image/s3,"s3://crabby-images/1f862/1f862cb6506311182f01369340368ba54e363c32" alt="Rounding Body Mass to 2 digits"
Write code to produce the Bar Chart with 2 sets of bars for each species - one for the Blue bars and one for the Pink bars.
data:image/s3,"s3://crabby-images/57009/57009ef663a6d705db1a30406e2efff1b4463f3f" alt="Code for 2 bars per species"
Run the code to produce the first draft of the Bar Chart with body mass values for male and female penguins.
data:image/s3,"s3://crabby-images/04256/04256e120e9a7ad8bafbb19bd82ba86c2536d3f2" alt="Output of 2 Bars plotted for each Species"
Additional updates to the Bar Chart code will make the Bar Chart Visualization a bit cleaner by adding code to the remove some of the border lines (left, right, top), adding Values on top of each Column, and a Legend for each group.
data:image/s3,"s3://crabby-images/7e8ce/7e8ce75d60a33aa48ab7946e66a0912ef82c414a" alt="Updated Code for 2 Bars plotted for each Species"
Final Bar Chart Version
data:image/s3,"s3://crabby-images/1b33d/1b33d73ea599bf980f5215bc719329abfaa41eb3" alt="Updated Bar Chart Data Visualization of Average Body Mass for Males and Female Penguin Species"