Covered data storing concepts
- Higher architecture and concepts
- SQL
- Relational DB (OLTP)
- Analytical DB (OLAP)
- NoSQL
- Key-Value DB
- Graph DB
- Document DB
- On-Premise / Cloud / Hybrid
* Following product types are included:
- Data storage engine referred here as a Database engine (wikipedia.org) (interchangeable with terms "Database server" or "DBMS")
- Column-oriented https://en.wikipedia.org/wiki/List_of_column-oriented_DBMSes
- ERP as part of data storage engines https://en.wikipedia.org/wiki/Category:ERP_software
- Business Intelligence software (wikipedia.org) referred here as a BI Tool (retrieve, analyze, transform and report data)
| Data Storing Engines | Specific Data Tools |
|---|---|
| Amazon Web Services #platform | Data Discovery |
| Amazon Athena #query | AWS Glue (data integration and catalog) |
| DynamoDB #storage #nosql | |
| Airbyte #integration | Fivetran (managed data movement) |
| Apache (multiple products) | SchemaCrawler (DB schema discovery & comprehension tool - github) |
| Apache Beam #processing | Apache NiFi (dataflow automation) |
| Apache Doris #storage #olap | |
| Apache Flink #processing #streaming | |
| Apache HBase #storage #nosql | |
| Apache Hudi #format | |
| Apache Kudu #storage #olap | |
| Apache Pinot #storage #olap | |
| Apache Solr #search | |
| AutoMQ #streaming | SodaSQL (data testing and monitoring - documentation) |
| ClickHouse #storage | |
| CockroachDB #storage | Data Processing |
| Cosmos DB #storage #nosql | Azure Data Factory (data integration) |
| Couchbase #storage #nosql | |
| CrateDB #storage | Apache Airflow (orchestration) |
| Databricks #storage #platform | Talend DataCleaner (Profiling & Cleansing) |
| DataWatch #storage | OpenRefine |
| Debezium | Meltano (data extracting) |
| Delta Lake #format | Prefect (workflow orchestration) |
| DuckDB #storage #embed | dbt (data transformation) |
| Elasticsearch #storage #search | dlt |
| Exasol #storage #olap | |
| Firestore #storage #document | Google Dataflow (stream and batch processing) |
| Google #platform | Estuary |
| IBM #platform | Kestra |
| InfluxDB #storage | Mage.AI |
| JSON (standalone / JSON native db) | y42 |
| MariaDB #storage | |
| MarkLogic #storage #nosql | |
| Milvus #storage #vector | Qdrant #storage #vector |
| MongoDB #storage #oltp | Data Analysis & Reporting (full list) |
| Microstrategy | GoodData #platform #analytics |
| Microsoft #platform | Incorta #platform #analytics |
| Minio | PowerBI #platform #analytics |
| Neo4J #storage #graph | Qlik #platform #analytics |
| OpenSearch #search | |
| Oracle #platform | SAS/STAT |
| Pentaho #etl | |
| PostgreSQL #storage | |
| Prometheus #storage #timeseries | |
| QuestDB #storage #timeseries | |
| Redis #storage | |
| RelationalAI | |
| Salesforce | |
| SAP #platform | Data Monitoring |
| ScyllaDB #storage #nosql | |
| SingleStore #storage | |
| Snowflake #storage | HP OpenView (Rep Agent compatible) |
| StarRocks #storage #olap | |
| SQLite #storage #embed | IBM Tivoli |
| Teradata #storage | Ignite |
| TigerBeetle | |
| TimescaleDB #storage #timeseries | |
| Trino #query | Temporal (durable workflow engine) |
| TDEngine | BMC |
| Vertica | Bradmark http://www.bradmark.com/ |
| Weaviate #storage #vector | |
| XML (standalone / XML native db) |
Universal Data Clients
- Data Grip
- DBeaver
- SquirelSQL
Universal Database tweakers
- System
- Architecture
- Product fundamentals
- Install
- Preparation, installation
- Editions
- Licensing
- Versions
- Upgrading
- Maintenance
- Operational Management
- Monitoring
- Security
- Backup / Recovery
view "engine_overview" with columns for
- engine name
- developer (language used + website)
- engine category (and storage type)
- basic categories (SQL, NoSQL, graph, key-value, document, time series)
- storage types (trasnactional, analytical, integration, data warehousing)
- supported operating systems (win, unix, linux, mac)
- security rating
- deployment model
- sourcing model (open-source?, free version available?)
Embedded databases (1): The harmony of DuckDB, KùzuDB and LanceDB | The Data Quarry Embedded databases (2): KùzuDB, an extremely fast OLAP graph database | The Data Quarry
Principles of Database Management: The Practical Guide to Storing, Managing and Analyzing Big and Small Data Database in Depth: Relational Theory for Practitioners What is High Availability? The Ultimate Guide | Percona
Seven Databases in Seven Weeks Segmentation Fault - A DBA Perspective
Index of /~database/documents @ University of Oklahoma
UI bakery sample databases Datasets - Data | World Resources Institute GitHub - jOOQ/sakila: The Sakila Database
Datové sady - Národní katalog otevřených dat (NKOD) The MONDIAL Database