Skip to content

aviralhub/URL_website_detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

6 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

URL Website Detection

This project builds a machine learning model to detect whether a given URL is malicious or legitimate.
It is fully implemented in a Google Colab notebook, making it easy to run without any local setup.

πŸ“Œ Features

  • Colab-Ready: Run the notebook directly in Google Colab.
  • Dataset Preprocessing: Cleaning, tokenizing, and extracting lexical & statistical features from URLs.
  • Feature Engineering: Attributes include:
    • URL length
    • Presence of suspicious keywords
    • Domain structure
    • Character patterns
  • Model Training: Experiments with multiple algorithms:
    • Logistic Regression
    • Decision Tree
    • Random Forest
  • Evaluation Metrics:
    • Accuracy
    • Precision
    • Recall
    • F1-Score

πŸ“‚ Repository Structure

URL_website_detection/
β”‚
β”œβ”€β”€ URL.ipynb               # Main Google Colab notebook
β”œβ”€β”€ dataset.csv             # Input dataset (if included)
β”œβ”€β”€ README.md               # Project documentation

πŸš€ How to Run

  1. Open the notebook in Google Colab using the badge above.
  2. Upload or connect the dataset.
  3. Run each cell sequentially to:
    • Preprocess the data
    • Extract features
    • Train models
    • Evaluate results

Author: Aviral Saini
Project Type: Machine Learning / URL Classification

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published