Automatically updated pypi API data, available in bulk via git or sqlite

The PyPI JSON Data project is a static dump of PyPI metadata, sourced directly from the API twice-daily. The data is stored in a GitHub repository, and is available for download or direct access.

The aim of the project is to provide a simple, easy-to-use dataset for PyPI metadata, without the need to query the API directly. This is useful for data analysis, or for building tools that require access to PyPI metadata.

The data can also be downloaded as a parquet or sqlite file from the GitHub releases page. Below is a snippet of the pip package’s JSON file

{
    "0.2": {
        "info": {
            "author": "The Open Planning Project",
            "author_email": "python-virtualenv@groups.google.com",
            "bugtrack_url": null,
            "classifiers": [
                "Development Status :: 4 - Beta",
                "Intended Audience :: Developers",
                "License :: OSI Approved :: MIT License",
                "Topic :: Software Development :: Build Tools"
            ],
            "description_content_type": null,
            "docs_url": null,
            "download_url": "UNKNOWN",
            "downloads": {
...