Plugin Directory

This is a list of known projects which install driver plugins for Intake, and the named drivers each contains:

Package Name Description Drivers CI Docs PyPi Conda
intake Builtin to Intake catalog, csv, intake_remote, ndzarr, numpy, textfiles, yaml_file_cat, yaml_files_cat, zarr_cat, json, jsonl
intake-astro Table and array loading of FITS astronomical data fits_array, fits_table
intake-accumulo Apache Accumulo clustered data storage accumulo
intake-avro Apache Avro data serialization format avro_table, avro_sequence
intake-bluesky Search and retrieve data in the bluesky data model
intake-dcat Browse and load data from DCAT catalogs dcat
intake-dremio Scan tables and send SQL queries to a Dremio server dremio
intake-duckdb Load DuckDB tables and build catalogs from DuckDB backends duckdb, duckdb_cat
intake-dynamodb Link to Amazon DynamoDB dynamodb
intake-elasticsearch Elasticsearch search and analytics engine elasticsearch_seq, elasticsearch_table
intake-esm Plugin for building and loading intake catalogs for earth system data sets holdings, such as CMIP (Coupled Model Intercomparison Project) and CESM Large Ensemble datasets
intake-geopandas Load from ESRI Shape Files, GeoJSON, and geospatial databases with geopandas geojson, postgis, shapefile, spatialite, regionmask
intake-google-analytics Run Google Analytics queries and load data as a DataFrame google_analytics_query
intake-hbase Apache HBase database hbase
intake-iris Load netCDF and GRIB files with IRIS grib, netcdf
intake-metabase Generate catalogs and load tables as DataFrames from Metabase metabase_catalog, metabase_table
intake-mongo MongoDB noSQL query mongo
intake-nested-yaml-catalog Plugin supporting a single YAML hierarchical catalog to organize datasets and avoid a data swamp nested_yaml_cat
intake-netflow Netflow packet format netflow
intake-notebook Experimental plugin to access parameterised notebooks through intake and executed via papermill ipynb
intake-odbc ODBC database odbc
intake-parquet Apache Parquet file format parquet
intake-pattern-catalog Plugin for specifying a file-path pattern which can represent a number of different entries pattern_cat
intake-pcap PCAP network packet format pcap
intake-postgres PostgreSQL database postgres
intake-s3-manifests s3_manifest
intake-salesforce Generate catalogs and load tables as DataFrames from Salesforce salesforce_catalog, salesforce_table
intake-sdmx Plugin for SDMX-compliant data sources such as BIS, ECB, ESTAT, INSEE, ILO, UN, UNICEF, World Bank and more sdmx_dataset
intake-sklearn Load scikit-learn models from Pickle files sklearn
intake-solr Apache Solr search platform solr
intake-stac Intake Driver for SpatioTemporal Asset Catalogs (STAC)
intake-stripe Generate catalogs and load tables as DataFrames from Stripe stripe_catalog, stripe_table
intake-spark Data processed by Apache Spark spark_cat, spark_rdd, spark_dataframe
intake-sql Generic SQL queries via SQLAlchemy sql_cat, sql, sql_auto, sql_manual
intake-sqlite Local caching of remote SQLite DBs and queries via SQLAlchemy sqlite_cat, sqlite, sqlite_auto, sqlite_manual
intake-splunk Splunk machine data query splunk
intake-streamz Real-time event processing using Streamz streamz
intake-thredds Intake interface to THREDDS data catalogs thredds_cat, thredds_merged_source
intake-xarray Load netCDF, Zarr and other multi-dimensional data xarray_image, netcdf, grib, opendap, rasterio, remote-xarray, zarr
intake-dataframe-catalog A searchable table of intake sources and associated metadata df_catalog

Don’t see your favorite format? See Making Drivers for how to create new plugins.

Note that if you want your plugin listed here, open an issue in the Intake issue repository and add an entry to the status dashboard repository. We also have a plugin wishlist Github issue that shows the breadth of plugins we hope to see for Intake.