A Lucid Framework for Transparent and Interpretable Machine Learning Models.

Last update: Aug 12, 2022

Overview

Currently a Beta-Version

lucidmode is an open-source, low-code and lightweight Python framework for transparent and interpretable machine learning models. It has built in machine learning methods optimized for visual interpretation of some of the most relevant calculations.

Documentation

Oficial Website: https://www.lucidmode.org
Documentation: https://lucidmode.readthedocs.io
Python Package Index (PyPI) repository: https://pypi.org/project/lucidmode/
Github repository: https://github.com/lucidmode/lucidmode

Installation

With package manager (coming soon)

Install by using pip package manager:

pip install lucidmode

Cloning repository

Clone entire github project

[email protected]:lucidmode/lucidmode.git

and then install dependencies

pip install -r requirements.txt

Models

Artificial Neural Network

Feedforward Multilayer perceptron with backpropagation.

fit: Fit model to data
predict: Prediction according to model

Initialization, Activations, Cost functions, regularization, optimization

Weights Initialization: With 4 types of criterias (zeros, xavier, common, he)
Activation Functions: sigmoid, tanh, ReLU
Cost Functions: Sum of Squared Error, Binary Cross-Entropy, Multi-Class Cross-Entropy
Regularization: L1, L2, ElasticNet for weights in cost function and in gradient updating
Optimization: Weights optimization with Gradient Descent (GD, SGD, Batch) with learning rate
Execution: Callback (metric threshold), History (Cost and metrics)
Hyperparameter Optimization: Random Grid Search with Memory

Complementary

Metrics: Accuracy, Confusion Matrix (Binary and Multiclass), Confusion Tensor (Multiclass OvR)
Visualizations: Cost evolution
Public Datasets: MNIST, Fashion MNIST
Special Datasets: OHLCV + Symbolic Features of Cryptocurrencies (ETH, BTC)

Important Links

Release notes: https://github.com/lucidmode/lucidmode/releases
Issues: https://github.com/lucidmode/lucidmode/issues
Example Notebooks: https://github.com/lucidmode/lucidmode/tree/main/notebooks
Documentation: https://lucidmode.readthedocs.io
Python Package Index (PyPI) repository: https://pypi.org/project/lucidmode/

Author/Principal Maintainer

Francisco Munnoz (IFFranciscoME) Is an associate professor of financial engineering and financial machine learning ITESO (Western Institute of Technology and Higher Education)

License

GNU General Public License v3.0

Permissions of this strong copyleft license are conditioned on making available complete source code of licensed works and modifications, which include larger works using a licensed work, under the same license. Copyright and license notices must be preserved. Contributors provide an express grant of patent rights.

Contact: For more information in reggards of this repo, please contact [email protected]

Implementations of Machine Learning models, Regularizers, Optimizers and different Cost functions.

Linear Models Implementations of LinearRegression, LassoRegression and RidgeRegression with appropriate Regularizers and Optimizers. Linear Regression

1 Nov 22, 2021

Tangram makes it easy for programmers to train, deploy, and monitor machine learning models.

Tangram Website | Discord Tangram makes it easy for programmers to train, deploy, and monitor machine learning models. Run tangram train to train a mo

1.4k Jan 5, 2023

SageMaker Python SDK is an open source library for training and deploying machine learning models on Amazon SageMaker.

SageMaker Python SDK SageMaker Python SDK is an open source library for training and deploying machine learning models on Amazon SageMaker. With the S

1.8k Jan 1, 2023

Model Validation Toolkit is a collection of tools to assist with validating machine learning models prior to deploying them to production and monitoring them after deployment to production.

25 Dec 28, 2022

easyNeuron is a simple way to create powerful machine learning models, analyze data and research cutting-edge AI.

5 Jun 18, 2022

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

Light Gradient Boosting Machine LightGBM is a gradient boosting framework that uses tree based learning algorithms. It is designed to be distributed a

14.5k Jan 7, 2023

Automated modeling and machine learning framework FEDOT

This repository contains FEDOT - an open-source framework for automated modeling and machine learning (AutoML). It can build custom modeling pipelines for different real-world processes in an automated way using an evolutionary approach. FEDOT supports classification (binary and multiclass), regression, clustering, and time series prediction tasks.

National Center for Cognitive Research of ITMO University

148 Jul 5, 2021

machine learning model deployment project of Iris classification model in a minimal UI using flask web framework and deployed it in Azure cloud using Azure app service

This is a machine learning model deployment project of Iris classification model in a minimal UI using flask web framework and deployed it in Azure cloud using Azure app service. We initially made this project as a requirement for an internship at Indian Servers. We are now making it open to contribution.

73 Dec 1, 2022

QuickAI is a Python library that makes it extremely easy to experiment with state-of-the-art Machine Learning models.

152 Jan 2, 2023

Releases(v0.4-beta1.0)

v0.4-beta1.0(Apr 29, 2021)
Metrics

Calculation of several metrics for classification sensitivity (TPR), specificity (TNR), accuracy (acc), likelihood ratio (positive), likelihood ratio (negative), confusion matrix (binary and multiclass) confusion tensor (binary for every class in multi-class)

Sequential Class

Move the cost_f and cost_r parameters to be specified from the formation method, leave the class instantiation with just the model architecture

Move the init_weights method to be specified from the formation method

Execution

Create formation method in the Sequential Class, with the following parameters init, cost, metrics, optimizer

Store selected metrics in Train and Validation History

Visualizations

Select metrics for verbose output

Source code(tar.gz)
Source code(zip)
v0.3-beta1.0(Apr 27, 2021)
Regularization:

On weights and biases, location: gradients

L1, L2 and ElasticNet

On weights and biases, location: cost function

L1, L2 and ElasticNet

Numerical Stability:

in functions.py, in cost, added a 1e-25 value to A, to avoid a divide by zero and invalid multiply cases in computations of np.log(A)

Data Handling:

train and validation cost

Visualization:

print: verbose of cost evolution

Documentation:

Improve README

Source code(tar.gz)
Source code(zip)
v0.2-beta1.0(Apr 27, 2021)
Files:

complete data set: MNIST

complete data set: 'fashion-MNIST'

Tests passed:

fashion MNIST

previous release tests

Topology

single hidden layer (tested)

1 - 2 hidden layers (tested)

different activation functions among hidden layer

Activation functions:

For hidden -> Sigmoid, Tanh, ReLU (tested and not working)

For output -> Softmax

Cost Functions:

'binary-logloss' (Binary-class Cross-Entropy)

'multi-logloss' (Multi-class Cross-Entropy)

Metrics:

Confusion matrix (Multi-class)

Accuracy (Multi-class)

Source code(tar.gz)
Source code(zip)
v0.1-beta1.0(Apr 26, 2021)
First release!

Tests passed:

Random XOR data classification

Sequential model:

hidden_l: Number of neurons per hidden layer (list of int, with a length of l_hidden)

hidden_a: Activation of hidden layers (list of str, with length l_hidden)

output_n: Number of neurons in the output layer (1)

output_a: Activation of output layer (str)

Layer transformations:

linear

Activation functions:

For hidden -> Sigmoid, Tanh

For output -> Sigmoid (Binary)

Weights Initialization:

Xavier normal, Xavier uniform, common uniform, according to [1]

Training Schemes:

Gradient Descent

Cost Functions:

Sum of Squared Error (SSE) or Residual Sum of Squares (RSS)

Metrics:

Accuracy (Binary)

Source code(tar.gz)
Source code(zip)
LucidNet_v0.1-beta1.0.zip(111.97 MB)

Owner

lucidmode

A lucid framework for interpretable machine learning models

GitHub Repository https://www.lucidmode.org

Python implementation of Weng-Lin Bayesian ranking, a better, license-free alternative to TrueSkill

Python implementation of Weng-Lin Bayesian ranking, a better, license-free alternative to TrueSkill This is a port of the amazing openskill.js package

156 Dec 14, 2022

pywFM is a Python wrapper for Steffen Rendle's factorization machines library libFM

pywFM pywFM is a Python wrapper for Steffen Rendle's libFM. libFM is a Factorization Machine library: Factorization machines (FM) are a generic approa

251 Sep 23, 2022

Model factory is a ML training platform to help engineers to build ML models at scale

Model Factory Machine learning today is powering many businesses today, e.g., search engine, e-commerce, news or feed recommendation. Training high qu

16 Sep 23, 2022

Pydantic based mock data generation

This library offers powerful mock data generation capabilities for pydantic based models. It can also be used with other libraries that use pydantic as a foundation, for example SQLModel, Beanie and

396 Dec 28, 2022

This repository has datasets containing information of Uber pickups in NYC from April 2014 to September 2014 and January to June 2015. data Analysis , virtualization and some insights are gathered here

uber-pickups-analysis Data Source: https://www.kaggle.com/fivethirtyeight/uber-pickups-in-new-york-city Information about data set The dataset contain

1 Nov 03, 2021

DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning.

DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning. DirectML provides GPU acceleration for common machine learning tasks across a broad range of supported ha

1.1k Jan 04, 2023

Data Version Control or DVC is an open-source tool for data science and machine learning projects

Continuous Machine Learning project integration with DVC Data Version Control or DVC is an open-source tool for data science and machine learning proj

2 Jul 29, 2021

LibRerank is a toolkit for re-ranking algorithms. There are a number of re-ranking algorithms, such as PRM, DLCM, GSF, miDNN, SetRank, EGRerank, Seq2Slate.

LibRerank LibRerank is a toolkit for re-ranking algorithms. There are a number of re-ranking algorithms, such as PRM, DLCM, GSF, miDNN, SetRank, EGRer

126 Dec 28, 2022

Generate music from midi files using BPE and markov model

37 Oct 24, 2022

Toolkit for building machine learning models that generalize to unseen domains and are robust to privacy and other attacks.

Toolkit for Building Robust ML models that generalize to unseen domains (RobustDG) Divyat Mahajan, Shruti Tople, Amit Sharma Privacy & Causal Learning

149 Jan 06, 2023

Simple linear model implementations from scratch.

Hand Crafted Models Simple linear model implementations from scratch. Table of contents Overview Project Structure Getting started Citing this project

2 Sep 13, 2021

GroundSeg Clustering Optimized Kdtree

ground seg and clustering based on kitti velodyne data, and a additional optimized kdtree for knn and radius nn search

2 Dec 02, 2021

XGBoost-Ray is a distributed backend for XGBoost, built on top of distributed computing framework Ray.

92 Dec 14, 2022

Automated Machine Learning Pipeline for tabular data. Designed for predictive maintenance applications, failure identification, failure prediction, condition monitoring, etc.

10 May 15, 2022

Distributed deep learning on Hadoop and Spark clusters.

Note: we're lovingly marking this project as Archived since we're no longer supporting it. You are welcome to read the code and fork your own version

1.3k Dec 28, 2022

Python 3.6+ toolbox for submitting jobs to Slurm

Submit it! What is submitit? Submitit is a lightweight tool for submitting Python functions for computation within a Slurm cluster. It basically wraps

768 Jan 03, 2023

MCML is a toolkit for semi-supervised dimensionality reduction and quantitative analysis of Multi-Class, Multi-Label data

MCML is a toolkit for semi-supervised dimensionality reduction and quantitative analysis of Multi-Class, Multi-Label data. We demonstrate its use

26 Nov 29, 2022

Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost Models on Time Series data sets with a Single Line of Code. Now updated with Dask to handle millions of rows.

Auto_TS: Auto_TimeSeries Automatically build multiple Time Series models using a Single Line of Code. Now updated with Dask. Auto_timeseries is a comp

519 Jan 03, 2023

50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster

[Due to the time taken @ uni, work + hell breaking loose in my life, since things have calmed down a bit, will continue commiting!!!] [By the way, I'm

1.4k Jan 01, 2023

A Python library for detecting patterns and anomalies in massive datasets using the Matrix Profile

matrixprofile-ts matrixprofile-ts is a Python 2 and 3 library for evaluating time series data using the Matrix Profile algorithms developed by the Keo

696 Dec 26, 2022