Skip to content

rubisco57/heatmaps

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

heatmapz - Better heatmaps in Python

Python code and Jupyter notebook for an improved heatmap implementation using Matplotlib and Seaborn.

Similar to what you can easily get in Tableau using a Size parameter, here you can have square size as parameter depending on the field value.

Described in detail in this blog post: https://towardsdatascience.com/better-heatmaps-and-correlation-matrix-plots-in-python-41445d0f2bec

See below for installation details, reference and examples.

Example of a correlation matrix

drawing

Example of a discrete joint distribution

drawing

Installation

To get the pip package run:

pip install heatmapz

Then you can use the following imports:

from heatmap import heatmap, corrplot

Reference


heatmap(x, y, **kwargs)

Parameters:

x : A list, np.array or pandas.Series containing the values for the horizontal dimension

y : A list, np.array or pandas.Series containing the values for the vertical dimension

Optional parameters:

color : A list, np.array or pandas.Series containing values based on which to apply the heatmap color. Should have the same length as x and y.

palette : A list of colors to use as the heatmap palette. The values from color are mapped onto the palette so that min(color) -> palette[0] and max(color) -> palette[len(palette)-1], and the values in between are linearly interpolated. A good way to choose or create a palette is to simply use Seaborn palettes (https://seaborn.pydata.org/tutorial/color_palettes.html).

color_range : A tuple (color_min, color_max) that enables capping the values of color being mapped to palette. All color values less than color_min are capped to color_min, and all color values larger than color_max are capped to color_max. Then those values are mapped to palette as described under color.

size : A list, np.array or pandas.Series containing values based on which to apply the size to the shapes in the plot. Should have the same length as x and y.

size_range : A tuple (size_min, size_max) that enables capping the values of size being applied to the shapes in the plot. Essentially controls min and max size of the shapes.

size_scale : Used to scale the size of the shapes in the plot to make them fit the size of the fields in the matrix. Default value is 500. You will likely need to fiddle with this parameter in order to find the right value for your figure size and the size range applied.

x_order : Should contain all distinct values of x ordered in the way you want them shown on the x-axis from left to right.

y_order : Should contain all distinct values of y ordered in the way you want them shown on the y-axis from bottom to top.

marker : Specify the shape to use in the plot. It can be any of the matplotlib marker shapes (https://matplotlib.org/api/markers_api.html). The default is 's' for square.

xlabel : Label for the x-axis. Default is none.

ylabel : Label for the y-axis. Default is none.


corrplot(data, size_scale=500, marker='s')

A shorthand function for making correlation plots from pandas dataframes.

Parameters:

data : You should pass the result of calling df.corr() on a dataframe.

size_scale : Used to scale the size of the shapes in the plot to make them fit the size of the fields in the matrix. Default value is 500. You will likely need to fiddle with this parameter in order to find the right value for your figure size and the size range applied.

marker : Specify the shape to use in the plot. It can be any of the matplotlib marker shapes (https://matplotlib.org/api/markers_api.html). The default is 's' for square.

Examples

You can find a walktrough with examples in this Colab notebook https://colab.research.google.com/drive/1YSvER-U3cwGplSHyXwaCXYhOWfWO53Iy

About

Better heatmaps in Python

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.4%
  • Python 0.6%