Getting StartedData Sources

Data Sources

Hotdata supports connecting to a wide range of databases, warehouses, lakes, and SaaS. Create a connection in your workspace to query tables from any of the sources below.

Databases

SourceDescription
PostgreSQLPostgreSQL — open-source relational database with strong SQL support and extensibility.
NeonNeon — serverless Postgres with branching and autoscaling.
SupabaseSupabase — managed Postgres, auth, storage, and APIs for applications.
MySQLMySQL — popular relational database for web and application workloads.
PlanetScalePlanetScale — serverless MySQL platform with branching and connection pooling.
DuckDBDuckDB — embedded analytical database optimized for OLAP and local analytics.
Flight SQLFlight SQL — Apache Arrow Flight SQL protocol for high-performance query transport.

Data warehouses

SourceDescription
SnowflakeSnowflake — cloud data warehouse for large-scale analytics and data engineering.
BigQueryGoogle BigQuery — serverless data warehouse for petabyte-scale analytics.
MotherDuckMotherDuck — serverless DuckDB in the cloud, compatible with local DuckDB.

Data lakes

SourceDescription
IcebergApache Iceberg — open table format for large-scale data lakes (REST or AWS Glue catalog).
DuckLakeDuckLake — DuckDB integration with Apache Iceberg for lakehouse workloads.
TigrisTigris — S3-compatible object storage (buckets via S3-style credentials).

Productivity

SourceDescription
AirtableAirtable — low-code database and collaboration platform for bases, grids, and automations.
CalendlyCalendly — scheduling and calendar integration for meetings and appointments.
CodaCoda — docs-as-apps platform combining documents, spreadsheets, and apps.
Google SheetsGoogle Sheets — cloud spreadsheets with collaboration and Sheets API access.
LinearLinear — issue tracking and project management for software teams.
MondayMonday — work OS for project management, workflows, and team collaboration.
NotionNotion — all-in-one workspace for notes, wikis, databases, and project management.
SlackSlack — messaging and collaboration for teams, channels, and integrations.
LatticeLattice — people management and performance platform.
LumaLuma — event and community platform.
n8nn8n — workflow automation and self-hosted Zapier alternative.

Data and analytics

SourceDescription
CensusCensus — reverse ETL and data activation from warehouse to business tools.
DaskDask — parallel computing library for analytics at scale via distributed DataFrames.
DatadogDatadog — observability platform for metrics, logs, traces, and APM.
SentrySentry — error tracking and performance monitoring for applications.
dbt Clouddbt Cloud — transformation layer and orchestration for the modern data stack.
EnigmaEnigma — public data platform for commercial and government datasets.
MetabaseMetabase — open-source BI and analytics for self-service dashboards and queries.
New RelicNew Relic — full-stack observability for applications, infrastructure, and logs.
SigmaSigma — cloud analytics and BI with spreadsheets interface for data exploration.
StatsigStatsig — experimentation and feature flags platform for product teams.
Sumo LogicSumo Logic — cloud-native SIEM and log analytics for security and DevOps.
AirNowAirNow — EPA air quality and air pollution data API.
BeamBeam — data integration and ETL platform.
CrunchbaseCrunchbase — company, investor, and funding data.
EppoEppo — experimentation and causal inference platform.
FireboltFirebolt — cloud data warehouse for analytics.
FiservFiserv — financial services and payment processing data.
FusegraphFusegraph — data platform and integration.
Grants.govGrants.gov — US federal grants and funding opportunities.
MarimoMarimo — reactive Python notebooks for data science.
Market APIMarket API — market and financial data.
MapboxMapbox — maps, geocoding, and location APIs.
MyParcelMyParcel — shipping and logistics for e-commerce.
Octopus EnergyOctopus Energy — energy and utility data.
OpenCorporatesOpenCorporates — global company and corporate data.
PropertyDataPropertyData API — real estate and property information.
RegridRegrid — nationwide parcel, addressing, and land grid data.
Sample CSVSample CSV — sample CSV datasets for testing (no auth).
YelpYelp — local business reviews and recommendations.
ZillowZillow — real estate listings and property data.

Development & ops

SourceDescription
CircleCICircleCI — CI and CD platform for building, testing, and deploying applications.
CodecovCodecov — code coverage reporting and analytics for test quality.
CloudflareCloudflare — DNS, CDN, security, and edge network services.
ConvexConvex — reactive backend with real-time database and serverless functions.
CriblCribl — observability pipeline for routing, transforming, and controlling data.
CrowdStrikeCrowdStrike — endpoint protection, threat intelligence, and incident response.
CursorCursor — AI-powered code editor built on VS Code.
GitHubGitHub — source code hosting, version control, and software collaboration.
JiraJira — issue tracking and project management for agile teams (Atlassian).
LaunchDarklyLaunchDarkly — feature flags and experimentation platform.
OktaOkta — workforce and customer identity, SSO, and access management.
OneLoginOneLogin — single sign-on and unified identity for applications.
RailwayRailway — deployment platform for applications and databases.
TectonTecton — feature platform for production ML feature stores.
TemporalTemporal — durable workflow execution for microservices orchestration.
Apache KafkaApache Kafka — distributed event streaming platform.
BugsnagBugsnag — error monitoring and application stability.
SnykSnyk — developer-first application security and dependency scanning.
Aikido SecurityAikido Security — cloud security posture and vulnerability management.
DrataDrata — compliance automation for SOC 2, HIPAA, and ISO.
GhostinspectorGhostinspector — automated browser testing and monitoring.
Incident.ioIncident.io — incident management and response platform.
InstatusInstatus — status page and downtime monitoring.
VantaVanta — security compliance and automation.

Customer and sales

SourceDescription
ApolloApollo — sales intelligence and engagement platform for prospecting.
ClayClay — data enrichment and prospecting for sales and recruiting.
HubSpotHubSpot — CRM, marketing, sales, and customer service platform.
NetSuiteNetSuite — cloud ERP for finance, inventory, and operations.
TwilioTwilio — programmable messaging, voice, and customer engagement APIs.
ZendeskZendesk — customer service software for support tickets and help centers.

HR and recruiting

SourceDescription
AshbyAshby — recruiting OS, scheduling, and candidate pipeline for high-growth teams.
BambooHRBambooHR — HRIS, onboarding, and employee records for growing companies.
DeelDeel — global payroll, compliance, and contractor management.
GreenhouseGreenhouse — applicant tracking, structured hiring, and recruiting analytics.
HiBobHiBob — HR, time off, and people analytics for modern mid-size teams.
LeverLever — recruiting ATS, nurture, and reporting for talent teams.
NamelyNamely — HR, payroll, and benefits for mid-market organizations.
PersonioPersonio — HR, recruiting, and payroll for European businesses.
RemoteRemote — global HR, payroll, and employer of record services.
WorkableWorkable — recruiting and applicant tracking for hiring teams.

Finance & billing

SourceDescription
BrexBrex — corporate cards, spend management, and business banking.
ChargebeeChargebee — subscription billing, invoicing, and revenue operations.
MercuryMercury — banking and treasury for startups and growing companies.
PlaidPlaid — financial account linking, auth, and open banking data.
RampRamp — corporate cards, bill pay, and expense automation.
StripeStripe — payments, billing, and financial infrastructure for the internet.
XeroXero — cloud accounting, payroll, and small-business finance.
ZuoraZuora — subscription monetization, billing, and revenue recognition.
SourceDescription
ClioClio — cloud practice management for law firms.
DocuSignDocuSign — electronic signatures and agreement lifecycle management.
Dropbox SignDropbox Sign — e-signatures and document workflows (HelloSign).
IroncladIronclad — contract lifecycle management and workflow automation.
LitifyLitify — legal practice management and intake on Salesforce.

AI and ML

SourceDescription
CohereCohere — enterprise AI platform for embeddings and generation.
ChromaChroma — embedding database and retrieval for LLM applications.
Exa AIExa AI — neural search for semantic similarity and retrieval.
LangDBLangDB — database for AI applications and vector search.
LangGraphLangGraph — framework for building stateful, multi-actor AI agents.
LangSmithLangSmith — observability and debugging for LLM applications.
MilvusMilvus — open-source vector database for similarity search and AI.
ModalModal — serverless GPU and compute for ML workloads.
PineconePinecone — managed vector database for embeddings and retrieval.
QdrantQdrant — vector similarity search engine for ML applications.
WeaviateWeaviate — open-source vector database with hybrid and semantic search.

Blockchain and web3

SourceDescription
AlgorandAlgorand — blockchain and cryptocurrency data.
BitcoinBitcoin — blockchain and cryptocurrency data.
Ocean ProtocolOcean Protocol — decentralized data exchange.
Open EthereumOpen Ethereum — Ethereum blockchain data.

Search & content

SourceDescription
AlgoliaAlgolia — hosted search API for applications and e-commerce.
FirecrawlFirecrawl — web scraping and content extraction API.