Getting StartedData Sources

Data Sources

Hotdata supports connecting to a wide range of databases, warehouses, lakes, and SaaS. Create a connection in your workspace to query tables from any of the sources below.

List connection types (available for your workspace):

hotdata connection-types list

Example output:

name        label
postgres    PostgreSQL
mysql       MySQL
snowflake   Snowflake
bigquery    Google BigQuery
duckdb      DuckDB

Databases

SourceDescription
PostgreSQLpostgresPostgreSQL — open-source relational database with strong SQL support and extensibility.
MySQLmysqlMySQL — popular relational database for web and application workloads.
DuckDBduckdbDuckDB — embedded analytical database optimized for OLAP and local analytics.
Flight SQLflight_sqlFlight SQL — Apache Arrow Flight SQL protocol for high-performance query transport.

Data warehouses

SourceDescription
SnowflakesnowflakeSnowflake — cloud data warehouse for large-scale analytics and data engineering.
BigQuerybigqueryGoogle BigQuery — serverless data warehouse for petabyte-scale analytics.
MotherDuckmotherduckMotherDuck — serverless DuckDB in the cloud, compatible with local DuckDB.

Data lakes

SourceDescription
IcebergicebergApache Iceberg — open table format for large-scale data lakes (REST or AWS Glue catalog).
DuckLakeducklakeDuckLake — DuckDB integration with Apache Iceberg for lakehouse workloads.

Productivity

SourceDescription
AirtableairtableAirtable — low-code database and collaboration platform for bases, grids, and automations.
CalendlycalendlyCalendly — scheduling and calendar integration for meetings and appointments.
CodacodaCoda — docs-as-apps platform combining documents, spreadsheets, and apps.
Google SheetsgsheetsGoogle Sheets — cloud spreadsheets with collaboration and Sheets API access.
LinearlinearLinear — issue tracking and project management for software teams.
MondaymondayMonday — work OS for project management, workflows, and team collaboration.
NotionnotionNotion — all-in-one workspace for notes, wikis, databases, and project management.
SlackslackSlack — messaging and collaboration for teams, channels, and integrations.
LatticelatticeLattice — people management and performance platform.
LumalumaLuma — event and community platform.
n8nn8nn8n — workflow automation and self-hosted Zapier alternative.

Data and analytics

SourceDescription
CensuscensusCensus — reverse ETL and data activation from warehouse to business tools.
DaskdaskDask — parallel computing library for analytics at scale via distributed DataFrames.
DatadogdatadogDatadog — observability platform for metrics, logs, traces, and APM.
dbt Clouddbt_clouddbt Cloud — transformation layer and orchestration for the modern data stack.
EnigmaenigmaEnigma — public data platform for commercial and government datasets.
MetabasemetabaseMetabase — open-source BI and analytics for self-service dashboards and queries.
New RelicnewrelicNew Relic — full-stack observability for applications, infrastructure, and logs.
SigmasigmaSigma — cloud analytics and BI with spreadsheets interface for data exploration.
StatsigstatsigStatsig — experimentation and feature flags platform for product teams.
Sumo LogicsumologicSumo Logic — cloud-native SIEM and log analytics for security and DevOps.
AirNowairnowAirNow — EPA air quality and air pollution data API.
BeambeamBeam — data integration and ETL platform.
CrunchbasecrunchbaseCrunchbase — company, investor, and funding data.
EppoeppoEppo — experimentation and causal inference platform.
FireboltfireboltFirebolt — cloud data warehouse for analytics.
FiservfiservFiserv — financial services and payment processing data.
FusegraphfusegraphFusegraph — data platform and integration.
Grants.govgrants_govGrants.gov — US federal grants and funding opportunities.
MarimomarimoMarimo — reactive Python notebooks for data science.
Market APImarket_apiMarket API — market and financial data.
MyParcelmyparcelMyParcel — shipping and logistics for e-commerce.
Octopus Energyoctopus_energyOctopus Energy — energy and utility data.
OpenCorporatesopen_corporatesOpenCorporates — global company and corporate data.
PropertyDataproperty_dataPropertyData API — real estate and property information.
Sample CSVsample_csvSample CSV — sample CSV datasets for testing (no auth).
YelpyelpYelp — local business reviews and recommendations.
ZillowzillowZillow — real estate listings and property data.

Development & ops

SourceDescription
CircleCIcircleciCircleCI — CI and CD platform for building, testing, and deploying applications.
CodecovcodecovCodecov — code coverage reporting and analytics for test quality.
ConvexconvexConvex — reactive backend with real-time database and serverless functions.
CriblcriblCribl — observability pipeline for routing, transforming, and controlling data.
CursorcursorCursor — AI-powered code editor built on VS Code.
GitHubgithubGitHub — source code hosting, version control, and software collaboration.
JirajiraJira — issue tracking and project management for agile teams (Atlassian).
LaunchDarklylaunchdarklyLaunchDarkly — feature flags and experimentation platform.
RailwayrailwayRailway — deployment platform for applications and databases.
TectontectonTecton — feature platform for production ML feature stores.
TemporaltemporalTemporal — durable workflow execution for microservices orchestration.
Apache Kafkaapache_kafkaApache Kafka — distributed event streaming platform.
BugsnagbugsnagBugsnag — error monitoring and application stability.
DratadrataDrata — compliance automation for SOC 2, HIPAA, and ISO.
GhostinspectorghostinspectorGhostinspector — automated browser testing and monitoring.
Incident.ioincident_ioIncident.io — incident management and response platform.
InstatusinstatusInstatus — status page and downtime monitoring.
VantavantaVanta — security compliance and automation.

Customer and sales

SourceDescription
ApolloapolloApollo — sales intelligence and engagement platform for prospecting.
ClayclayClay — data enrichment and prospecting for sales and recruiting.
HubSpothubspotHubSpot — CRM, marketing, sales, and customer service platform.
ZendeskzendeskZendesk — customer service software for support tickets and help centers.

AI and ML

SourceDescription
CoherecohereCohere — enterprise AI platform for embeddings and generation.
Exa AIexa_aiExa AI — neural search for semantic similarity and retrieval.
LangDBlangdbLangDB — database for AI applications and vector search.
LangGraphlanggraphLangGraph — framework for building stateful, multi-actor AI agents.
LangSmithlangsmithLangSmith — observability and debugging for LLM applications.
MilvusmilvusMilvus — open-source vector database for similarity search and AI.
ModalmodalModal — serverless GPU and compute for ML workloads.
PineconepineconePinecone — managed vector database for embeddings and retrieval.
QdrantqdrantQdrant — vector similarity search engine for ML applications.

Blockchain and web3

SourceDescription
AlgorandalgorandAlgorand — blockchain and cryptocurrency data.
BitcoinbitcoinBitcoin — blockchain and cryptocurrency data.
Ocean Protocolocean_protocolOcean Protocol — decentralized data exchange.
Open Ethereumopen_ethereumOpen Ethereum — Ethereum blockchain data.

Search & content

SourceDescription
AlgoliaalgoliaAlgolia — hosted search API for applications and e-commerce.
FirecrawlfirecrawlFirecrawl — web scraping and content extraction API.