Used for data processing in machine learning, and help us to construct ML model more easily from scratch

Last update: Jul 05, 2022

Related tags

Overview

Popanda

Author: Shengxuan Wang at OSU.

Used for data processing in machine learning, and help us to construct ML model more easily from scratch. Can be used in linear model, logistic regression model, and decision tree.

The name is from "Po" in the movie Kung Fu Panda.

Can use it by:

import popanda as ppd

To use this package, please make sure you already install the latest "pandas" and "math".

Remember to put it in the same directory with your code.

11/26/2021: release

12/23/2021: add two functions using in dicision tree model

Owner

ShawnWang

GitHub Repository

Hg002-qc-snakemake - HG002 QC Snakemake

HG002 QC Snakemake To Run Resources and data specified within snakefile (hg002QC

2 Feb 16, 2022

Picka: A Python module for data generation and randomization.

Picka: A Python module for data generation and randomization. Author: Anthony Long Version: 1.0.1 - Fixed the broken image stuff. Whoops What is Picka

108 Nov 30, 2021

Stock Analysis dashboard Using Streamlit and Python

StDashApp Stock Analysis Dashboard Using Streamlit and Python If you found the content useful and want to support my work, you can buy me a coffee! Th

27 Dec 09, 2022

Sample code for Harry's Airflow online trainng course

Sample code for Harry's Airflow online trainng course You can find the videos on youtube or bilibili. I am working on adding below things: the slide p

102 Dec 30, 2022

WaveFake: A Data Set to Facilitate Audio DeepFake Detection

WaveFake: A Data Set to Facilitate Audio DeepFake Detection This is the code repository for our NeurIPS 2021 (Track on Datasets and Benchmarks) paper

27 Dec 22, 2022

CRISP: Critical Path Analysis of Microservice Traces

CRISP: Critical Path Analysis of Microservice Traces This repo contains code to compute and present critical path summary from Jaeger microservice tra

110 Jan 06, 2023

Fast, flexible and easy to use probabilistic modelling in Python.

Please consider citing the JMLR-MLOSS Manuscript if you've used pomegranate in your academic work! pomegranate is a package for building probabilistic

3k Jan 02, 2023

DaCe is a parallel programming framework that takes code in Python/NumPy and other programming languages

aCe - Data-Centric Parallel Programming Decoupling domain science from performance optimization. DaCe is a parallel programming framework that takes c

330 Dec 30, 2022

Data pipelines built with polars

valves Warning: the project is very much work in progress. Valves is a collection of functions for your data .pipe()-lines. This project aimes to host

14 Jan 03, 2023

Titanic data analysis for python

Titanic-data-analysis This Repo is an analysis on Titanic_mod.csv This csv file contains some assumed data of the Titanic ship after sinking This full

1 Dec 26, 2021

PrimaryBid - Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift

Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift This project is composed of two parts: Part1 and Part2

1 Jan 19, 2022

Used for data processing in machine learning, and help us to construct ML model more easily from scratch

Related tags

Overview

Popanda

Owner

ShawnWang

Hg002-qc-snakemake - HG002 QC Snakemake

Picka: A Python module for data generation and randomization.

Stock Analysis dashboard Using Streamlit and Python

Sample code for Harry's Airflow online trainng course

WaveFake: A Data Set to Facilitate Audio DeepFake Detection

CRISP: Critical Path Analysis of Microservice Traces

Fast, flexible and easy to use probabilistic modelling in Python.

DaCe is a parallel programming framework that takes code in Python/NumPy and other programming languages

Data pipelines built with polars

Titanic data analysis for python

PrimaryBid - Transform application Lifecycle Data and Design and ETL pipeline architecture for ingesting data from multiple sources to redshift

MS in Data Science capstone project. Studying attacks on autonomous vehicles.

Flexible HDF5 saving/loading and other data science tools from the University of Chicago

MidTerm Project for the Data Analysis FT Bootcamp, Adam Tycner and Florent ZAHOUI

DefAP is a program developed to facilitate the exploration of a material's defect chemistry

Statistical package in Python based on Pandas

INFO-H515 - Big Data Scalable Analytics

Synthetic Data Generation for tabular, relational and time series data.

pyETT: Python library for Eleven VR Table Tennis data

Import, connect and transform data into Excel