Skip to content

api-evangelist/apache-madlib

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Apache MADlib (apache-madlib)

Apache MADlib is an open-source library for scalable in-database analytics. It provides data-parallel implementations of mathematical, statistical, and machine learning methods for structured and unstructured data, executed within PostgreSQL or Greenplum Database. MADlib enables data scientists to run machine learning algorithms directly in the database using SQL.

URL: Visit APIs.json URL

Run: Capabilities Using Naftiko

Tags:

  • In-Database Analytics, Machine Learning, PostgreSQL, SQL, Statistics, Deep Learning

Timestamps

  • Created: 2026-03-16
  • Modified: 2026-04-19

APIs

Apache MADlib

MADlib provides SQL-callable functions for classification, regression, clustering, dimensionality reduction, graph analytics, time series analysis, deep learning with Keras/TensorFlow backend, and other machine learning algorithms running directly within PostgreSQL or Greenplum Database with GPU acceleration support.

Human URL: https://madlib.apache.org/docs/latest/index.html

Tags:

  • Machine Learning, PostgreSQL, SQL, Deep Learning, Statistics

Properties

Common Properties

Features

Name Description
In-Database Machine Learning Run machine learning algorithms directly within PostgreSQL or Greenplum Database using SQL, eliminating data movement overhead.
Classification and Regression Support for logistic regression, linear regression, naive Bayes, decision trees, random forests, support vector machines, and more.
Clustering Algorithms K-Means, DBSCAN, and other clustering algorithms for unsupervised learning within the database.
Deep Learning with Keras/TensorFlow Train and serve deep learning models using Keras and TensorFlow backends with GPU acceleration support.
Graph Analytics Built-in graph algorithms for network analysis, path finding, and community detection on graph data stored in the database.
Time Series Analysis ARIMA, SARIMA, and other time series forecasting models running in-database.
Dimensionality Reduction PCA and SVD implementations for dimensionality reduction and feature extraction.
Model Selection and Hyperparameter Tuning Cross-validation and hyperparameter optimization frameworks for model selection.
Association Rules FP-Growth and Apriori algorithms for market basket analysis and association rule mining.

Use Cases

Name Description
Predictive Analytics Build predictive models for churn prediction, fraud detection, and demand forecasting directly on database data.
Recommendation Systems Implement collaborative filtering and content-based recommendation algorithms using in-database machine learning.
Customer Segmentation Cluster customers using K-Means and other algorithms to identify segments for targeted marketing.
Anomaly Detection Detect anomalies in time series and transactional data using statistical models running in-database.
Network Analysis Analyze social networks, supply chains, and communication graphs using built-in graph algorithms.

Integrations

Name Description
PostgreSQL Primary execution environment supporting PostgreSQL versions 11 through 15.
Greenplum Database Native support for Greenplum Database GP6 and GP7 for massively parallel processing.
TensorFlow Deep learning backend integration for training neural networks within the database.
Keras High-level deep learning API integration for building and training models with GPU acceleration.
XGBoost Gradient boosting framework integration for high-performance tree-based models.

Maintainers

FN: Kin Lane

Email: info@apievangelist.com

About

Apache MADlib is an open-source library for scalable in-database analytics. It provides data-parallel implementations of mathematical, statistical, and machine learning methods for structured and unstructured data, executed within PostgreSQL or Greenplum Database.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors