Transaction Fraud Detection System - Project Documentation

Overview

This project is a full-stack fraud detection system that integrates a Python web interface with a high-performance C++ analysis engine. It utilizes advanced data structures and algorithms (Trie, Hash Map, KMP) to detect suspicious patterns in financial transactions.

System Architecture & Data Flow

1. User Input (Frontend)

Interface: The user interacts with the dashboard (index.html).
Input Method: Transactions are entered into a text area in CSV format (e.g., 1001,500,Offshore Payment).
Trigger: Clicking "Run Detection Engine" executes the JavaScript analyzeTransactions() function.
Payload: The input text is packaged into a JSON object ({ "data": "..." }) and sent to the backend via a POST request.

2. Data Transfer (Middleware)

Server: A Python Flask server (app.py) handles the request at the /analyze endpoint.
Process Management: Python spawns the C++ executable (fraud_engine) as a subprocess using the subprocess module.
Input Pipe: The raw transaction text is piped directly into the C++ program's Standard Input (stdin).

3. C++ Analysis Engine (`fraud_engine.cpp`)

The C++ engine serves as the core processing unit, executing three distinct algorithms to identify potential fraud.

Phase A: Parsing & Pre-processing

Reads input from stdin line-by-line.
Parses CSV data to extract ID, Amount, and Description.
Populates a frequencyMap (Hash Map) to track transaction counts per User ID.

Phase B: Detection Algorithms

1. Trie (Prefix Tree) - Blacklist Verification

Purpose: Instant lookup of known malicious IDs.
Mechanism: A Trie structure is pre-loaded with blacklisted IDs (e.g., "9999", "1001"). The system traverses this tree with the current transaction ID.
Advantage: Provides efficient prefix-based matching and fast lookups, independent of the blacklist size.

2. Hash Map (Frequency Map) - Velocity Check

Purpose: Detect high-frequency transaction spamming.
Mechanism: Utilizes the pre-calculated frequencyMap. If a User ID's occurrence count exceeds a threshold (e.g., > 3), it is flagged as "High Frequency Fraud".
Advantage: Offers O(1) constant time complexity for lookups, ensuring speed regardless of the dataset size.

3. KMP (Knuth-Morris-Pratt) - Pattern Matching

Purpose: Identify suspicious keywords within transaction descriptions (e.g., "crypto", "offshore", "bet").
Mechanism: Implements the KMP algorithm, which utilizes an LPS (Longest Prefix Suffix) array. This allows the search to skip unnecessary comparisons upon mismatch.
Advantage: Significantly more efficient than brute-force string searching, especially for longer text descriptions, avoiding redundant checks.

4. Output Generation & Rendering

Serialization: The C++ engine constructs a JSON string representing the analysis results (e.g., [{"id": "1001", "is_suspicious": true, ...}]).
Output: The JSON string is printed to Standard Output (stdout).
Response: Python captures this output and returns it to the frontend.
Visualization: JavaScript parses the response and dynamically updates the UI, highlighting suspicious transactions in red and safe ones in green based on the is_suspicious flag.

🚀 Setup & Installation

Follow these steps to set up the project locally.

1. Clone the Repository

git clone https://github.com/akaraj187/DAA_Project.git
cd 'DAA Project'/fraud_detection_system

Create a Virtual Environment

It is recommended to use a virtual environment to manage dependencies.

Mac / Linux:

Bash

python3 -m venv venv
source venv/bin/activate
Windows (Cmd/PowerShell):

Bash

python -m venv venv
venv\Scripts\activate

Build the Project

This step installs Python dependencies and compiles the C++ engine.

Option A: Mac / Linux (Using build script)

Bash

chmod +x build.sh
./build.sh
Option B: Windows (Manual Build) Since Windows cannot run .sh files natively, run these two commands manually:

Bash

pip install -r requirements.txt
g++ -o fraud_engine fraud_engine.cpp
🏃‍♂️ Running the Application
Once the build is complete, start the web server:

Bash

python app.py
(Note: If your main file is named something else, like main.py or server.py, replace app.py with that name)

The application should now be running at http://localhost:5000 (or the port specified in your terminal).

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
fraud_detection_system		fraud_detection_system
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transaction Fraud Detection System - Project Documentation

Overview

System Architecture & Data Flow

1. User Input (Frontend)

2. Data Transfer (Middleware)

3. C++ Analysis Engine (`fraud_engine.cpp`)

Phase A: Parsing & Pre-processing

Phase B: Detection Algorithms

4. Output Generation & Rendering

🚀 Setup & Installation

1. Clone the Repository

About

Uh oh!

Releases

Packages

Languages

akaraj187/DAA_Project

Folders and files

Latest commit

History

Repository files navigation

Transaction Fraud Detection System - Project Documentation

Overview

System Architecture & Data Flow

1. User Input (Frontend)

2. Data Transfer (Middleware)

3. C++ Analysis Engine (fraud_engine.cpp)

Phase A: Parsing & Pre-processing

Phase B: Detection Algorithms

4. Output Generation & Rendering

🚀 Setup & Installation

1. Clone the Repository

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

3. C++ Analysis Engine (`fraud_engine.cpp`)

Packages