pybabynames
data:image/s3,"s3://crabby-images/cd905/cd905e0a2ca7bdcc1e24610cd29a025951ccf9ef" alt="License"
Python port of the R data package babynames
. This package provides US baby names data from the Social Security Administration (SSA). It contains all names used for at least 5 children of either sex in the United States. The package features the ability to switch between the data being imported as a Polars DataFrame (default) or a Pandas DataFrame by setting an environment variable.
[!NOTE]
Please note that the pybabynames
package is a community-driven initiative and is not affiliated with Posit, Tidyverse, or the main babynames R package.
Its evolution and maintenance stem solely from the collective efforts of community members.
Installation
Install this library using pip
into an environment that already has either Pandas or Polars installed.
pip install pybabynames
Missing Pandas or Polars? You can install these packages using:
pip install polars
pip install pandas
Usage
import pybabynames as bn
babynames = bn.babynames
applicants = bn.applicants
births = bn.births
lifetables = bn.lifetables
[!IMPORTANT]
By default, we'll attempt to use the polars
module. You can switch back to using pandas
by
specifying before babynames
import statement an environment flag like so:
import os
os.environ["DATAFRAME_FRAMEWORK"] = "pandas"
import pybabynames as bn
Development
To contribute to this library, first checkout the code. Then create a new virtual environment:
cd pybabynames
python -m venv venv
source venv/bin/activate
Now install the dependencies and test dependencies:
python -m pip install -e '.[test]'
To run the tests:
python -m pytest
Acknowledgement
This Python package is a port of the R Data package babynames
by Hadley Wickham.