Skip to content

kubow/datastore

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

294 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

General repository about data related technologies

Covered data storing concepts

  • Higher architecture and concepts
  • SQL
    • Relational DB (OLTP)
    • Analytical DB (OLAP)
  • NoSQL
    • Key-Value DB
    • Graph DB
    • Document DB
  • On-Premise / Cloud / Hybrid

Covered data engines *

* Following product types are included:


Data Storing Engines Specific Data Tools
Amazon Web Services #platform Data Discovery
Amazon Athena #query AWS Glue (data integration and catalog)
DynamoDB #storage #nosql
Airbyte #integration Fivetran (managed data movement)
Apache (multiple products) SchemaCrawler (DB schema discovery & comprehension tool - github)
Apache Beam #processing Apache NiFi (dataflow automation)
Apache Doris #storage #olap
Apache Flink #processing #streaming
Apache HBase #storage #nosql
Apache Hudi #format
Apache Kudu #storage #olap
Apache Pinot #storage #olap
Apache Solr #search
AutoMQ #streaming SodaSQL (data testing and monitoring - documentation)
ClickHouse #storage
CockroachDB #storage Data Processing
Cosmos DB #storage #nosql Azure Data Factory (data integration)
Couchbase #storage #nosql
CrateDB #storage Apache Airflow (orchestration)
Databricks #storage #platform Talend DataCleaner (Profiling & Cleansing)
DataWatch #storage OpenRefine
Debezium Meltano (data extracting)
Delta Lake #format Prefect (workflow orchestration)
DuckDB #storage #embed dbt (data transformation)
Elasticsearch #storage #search dlt
Exasol #storage #olap
Firestore #storage #document Google Dataflow (stream and batch processing)
Google #platform Estuary
IBM #platform Kestra
InfluxDB #storage Mage.AI
JSON (standalone / JSON native db) y42
MariaDB #storage
MarkLogic #storage #nosql
Milvus #storage #vector Qdrant #storage #vector
MongoDB #storage #oltp Data Analysis & Reporting (full list)
Microstrategy GoodData #platform #analytics
Microsoft #platform Incorta #platform #analytics
Minio PowerBI #platform #analytics
Neo4J #storage #graph Qlik #platform #analytics
OpenSearch #search
Oracle #platform SAS/STAT
Pentaho #etl
PostgreSQL #storage
Prometheus #storage #timeseries
QuestDB #storage #timeseries
Redis #storage
RelationalAI
Salesforce
SAP #platform Data Monitoring
ScyllaDB #storage #nosql
SingleStore #storage
Snowflake #storage HP OpenView (Rep Agent compatible)
StarRocks #storage #olap
SQLite #storage #embed IBM Tivoli
Teradata #storage Ignite
TigerBeetle
TimescaleDB #storage #timeseries
Trino #query Temporal (durable workflow engine)
TDEngine BMC
Vertica Bradmark http://www.bradmark.com/
Weaviate #storage #vector
XML (standalone / XML native db)

Universal Data Clients

  • Data Grip
  • DBeaver
  • SquirelSQL

Universal Database tweakers

Model structure for any product

  • System
    • Architecture
    • Product fundamentals
  • Install
    • Preparation, installation
    • Editions
    • Licensing
    • Versions
    • Upgrading
  • Maintenance
    • Operational Management
    • Monitoring
    • Security
    • Backup / Recovery

Engines overview database description (sqlite3 database source)

view "engine_overview" with columns for

  • engine name
  • developer (language used + website)
  • engine category (and storage type)
    • basic categories (SQL, NoSQL, graph, key-value, document, time series)
    • storage types (trasnactional, analytical, integration, data warehousing)
  • supported operating systems (win, unix, linux, mac)
  • security rating
  • deployment model
  • sourcing model (open-source?, free version available?)

Useful resources

Embedded databases (1): The harmony of DuckDB, KùzuDB and LanceDB | The Data Quarry Embedded databases (2): KùzuDB, an extremely fast OLAP graph database | The Data Quarry

Books

Principles of Database Management: The Practical Guide to Storing, Managing and Analyzing Big and Small Data Database in Depth: Relational Theory for Practitioners What is High Availability? The Ultimate Guide | Percona

Seven Databases in Seven Weeks Segmentation Fault - A DBA Perspective

Free sources

Index of /~database/documents @ University of Oklahoma

UI bakery sample databases Datasets - Data | World Resources Institute GitHub - jOOQ/sakila: The Sakila Database

Datové sady - Národní katalog otevřených dat (NKOD) The MONDIAL Database

Web sites

Knowledge Base of Relational and NoSQL DBMS