Skip to content

This web application allows users to extract clean, formatted text content from any website by simply entering a URL.

Notifications You must be signed in to change notification settings

filippodanesi/web-scraper

Repository files navigation

Web Content Scraper

Extract and analyze text content from any website.

Project Overview

This web application allows users to extract clean, formatted text content from any website by simply entering a URL. It's useful for research, analysis, or content aggregation without the clutter of website styling and formatting.

Features

  • Simple URL input for scraping web content
  • Clean, formatted text output
  • Minimalist black and white design
  • Dark mode support
  • API key management for Firecrawl service

Technologies Used

  • React
  • TypeScript
  • Tailwind CSS
  • shadcn/ui components
  • Firecrawl API for web scraping

Getting Started

  1. Clone this repository
  2. Install dependencies: npm install
  3. Start the development server: npm run dev
  4. Open your browser and navigate to the local development URL

API Key Setup

This project uses the Firecrawl API for web scraping. You'll need to:

  1. Sign up for a Firecrawl API key at https://firecrawl.dev
  2. Enter your API key in the application settings
  3. Start scraping web content

Development

Available Scripts

  • npm run dev - Start the development server
  • npm run build - Build the project for production
  • npm run preview - Preview the production build locally

About

This web application allows users to extract clean, formatted text content from any website by simply entering a URL.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages