Exploring Machine Learning Models for detecting anomalous behavior in credit-card transactions. It's crucial that credit-card companies are able to recognize fraudulent activity so that customers are not charged for items they didn't purchase.

Last update: Nov 17, 2022

Overview

Credit Card Fraud Detection

Came across this mocked-up dataset of customer transactions at [Capital One Recruitment Challenge](https://github.com/CapitalOneRecruiting/DS).
The unbalanced dataset is comprised of artificial customer transactions with a few outlier cases where fraud was detected. There's only ~1.6% fraudulent cases.
Our primary goal is to successfully predict whether a transaction is Fraudulent or not, and avoid Type-II errors as much as possible as in most sensitive classification problems: we'll try not to point accusatory-fingers at genuine-transactions 😂 .
The secondary goal is to identify interesting anomalies in the transactions like multi-swipes, reversal of suspicious transactions, etc. by performing exploratory-data-analysis.
Most numerical-fields seem to follow Power-law distributions rather than Gaussian distributions.
We'll engineer some time-dependent categorical features by parsing the datetime fields, exclude the fields which have just one categorical value (makes no sense keeping these around 😒 ), and also create a new feature to indicate if credit-card-CVV is wrongly entered.
Baseline classifiers chosen are Logistic Regression, SVM, Random Forest, Isolated Forest.
Performance is kinda poor on these Baseline models: Accuracy, precision, and recall vary greatly across the models.
Moving on Gradient-Boosting models, Light Gradient Boosting is known to perform well on sparse datasets.
Final accuracy achieved hovers around 98%, and recall is approximately 99.99% indicating that False-Negatives are absolutely minimal.

Exploring Machine Learning Models for detecting anomalous behavior in credit-card transactions. It's crucial that credit-card companies are able to recognize fraudulent activity so that customers are not charged for items they didn't purchase.

Related tags

Overview

Credit Card Fraud Detection

Owner

Vikrant Deshpande

Does Oversizing Improve Prosumer Profitability in a Flexibility Market? - A Sensitivity Analysis using PV-battery System

PyTorch code for our paper "Image Super-Resolution with Non-Local Sparse Attention" (CVPR2021).

This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".

Goal of the project : Detecting Temporal Boundaries in Sign Language videos

Predicts an answer in yes or no.

Code for "Multi-Compound Transformer for Accurate Biomedical Image Segmentation"

Radar-to-Lidar: Heterogeneous Place Recognition via Joint Learning

Iterative Training: Finding Binary Weight Deep Neural Networks with Layer Binarization

Performant, differentiable reinforcement learning

Understanding and Overcoming the Challenges of Efficient Transformer Quantization

A framework for attentive explainable deep learning on tabular data

Creating Artificial Life with Reinforcement Learning

Official implementation of Rich Semantics Improve Few-Shot Learning (BMVC, 2021)

Release of the ConditionalQA dataset

Do Neural Networks for Segmentation Understand Insideness?

RLMeta is a light-weight flexible framework for Distributed Reinforcement Learning Research.

This repo includes our code for evaluating and improving transferability in domain generalization (NeurIPS 2021)

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

BaseCls BaseCls 是一个基于 MegEngine 的预训练模型库，帮助大家挑选或训练出更适合自己科研或者业务的模型结构

Demonstrates how to divide a DL model into multiple IR model files (division) and introduce a simplest way to implement a custom layer works with OpenVINO IR models.