How-to guides#

Practical recipes for common tasks. Each guide assumes you have already installed condastats (see Installation).

Filter by time period #

Single month

Pass --month (CLI) or the month keyword (Python):

CLI

$ condastats overall pandas --month 2024-01

Python

overall("pandas", month="2024-01")

Date range

Pass --start_month and --end_month together:

CLI

$ condastats overall pandas --start_month 2024-01 --end_month 2024-06

Python

overall("pandas", start_month="2024-01", end_month="2024-06")

Monthly breakdown

Add --monthly to split the result by month instead of summing:

CLI

$ condastats overall pandas --start_month 2024-01 --end_month 2024-03 --monthly

Python

overall("pandas", start_month="2024-01", end_month="2024-03", monthly=True)

Group downloads by dimension #

condastats has four grouping subcommands. Each one splits download counts by a different column in the dataset:

Subcommand / function	Groups by	Example values
`pkg_platform`	Platform architecture	`linux-64`, `osx-arm64`, `win-64`
`data_source`	Channel or repository	`anaconda`, `conda-forge`
`pkg_version`	Package version	`1.5.3`, `2.0.0`
`pkg_python`	Python version used	`3.10`, `3.11`, `3.12`

All four accept the same time-filtering options (--month, --start_month/--end_month, --monthly).

CLI

$ condastats data_source pandas --month 2024-01

Python

from condastats import data_source
data_source("pandas", month="2024-01")

Filter overall downloads by multiple criteria #

The overall subcommand (and overall() function) supports additional filter flags to narrow results by platform, data source, package version, and Python version – all at once:

CLI

$ condastats overall pandas --month 2024-01 \
    --pkg_platform linux-64 \
    --data_source conda-forge \
    --pkg_python 3.11

Python

overall(
    "pandas",
    month="2024-01",
    pkg_platform="linux-64",
    data_source="conda-forge",
    pkg_python="3.11",
)

Compare multiple packages #

Pass multiple package names on the command line, or a list in Python:

CLI

$ condastats overall pandas numpy scipy --month 2024-01

Python

overall(["pandas", "numpy", "scipy"], month="2024-01")

This works with every subcommand, not just overall.

Get the raw DataFrame #

By default overall() aggregates results into a pandas.Series. Pass complete=True to get the full, unaggregated pandas.DataFrame:

df = overall("pandas", month="2024-01", complete=True)
df.head()

Tip

This is useful when you want to do custom grouping or further analysis with the full dataset.

Use condastats in Jupyter notebooks #

condastats works out of the box in Jupyter. Because every function returns a pandas object, results render as rich HTML tables:

from condastats import overall, pkg_platform

# A Series renders as a nice table in Jupyter
overall(["pandas", "numpy"], month="2024-01")

You can also call the CLI from a notebook cell with a ! prefix:

!condastats pkg_platform pandas --month 2024-01

How-to guides#

Filter by time period#

Group downloads by dimension#

Filter overall downloads by multiple criteria#

Compare multiple packages#

Get the raw DataFrame#

Use condastats in Jupyter notebooks#

Run condastats without installing#

Filter by time period #

Group downloads by dimension #

Filter overall downloads by multiple criteria #

Compare multiple packages #

Get the raw DataFrame #

Use condastats in Jupyter notebooks #

Run condastats without installing #