URL Website Detection

This project builds a machine learning model to detect whether a given URL is malicious or legitimate.
It is fully implemented in a Google Colab notebook, making it easy to run without any local setup.

📌 Features

Colab-Ready: Run the notebook directly in Google Colab.
Dataset Preprocessing: Cleaning, tokenizing, and extracting lexical & statistical features from URLs.
Feature Engineering: Attributes include:
- URL length
- Presence of suspicious keywords
- Domain structure
- Character patterns
Model Training: Experiments with multiple algorithms:
- Logistic Regression
- Decision Tree
- Random Forest
Evaluation Metrics:
- Accuracy
- Precision
- Recall
- F1-Score

📂 Repository Structure

URL_website_detection/
│
├── URL.ipynb               # Main Google Colab notebook
├── dataset.csv             # Input dataset (if included)
├── README.md               # Project documentation

🚀 How to Run

Open the notebook in Google Colab using the badge above.
Upload or connect the dataset.
Run each cell sequentially to:
- Preprocess the data
- Extract features
- Train models
- Evaluate results

Author: Aviral Saini
Project Type: Machine Learning / URL Classification

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
URL.ipynb		URL.ipynb
dataset.zip		dataset.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

URL Website Detection

📌 Features

📂 Repository Structure

🚀 How to Run

About

Uh oh!

Releases

Packages

Languages

aviralhub/URL_website_detection

Folders and files

Latest commit

History

Repository files navigation

URL Website Detection

📌 Features

📂 Repository Structure

🚀 How to Run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages