url_classifier / README.md
obaidtambo's picture
Update README.md
a131435 verified
metadata
license: mit
library_name: transformers
pipeline_tag: text-classification
tags:
  - URL
  - BinaryCalssification
  - URL Classification
  - BERT URL calssifier
widget:
  - text: torgi.kz/help/id2.txt
  - text: vvps.ws/44/mothersdarlingcross.php?
  - text: >-
      torrentreactor.net/torrents/897644/ANGELCORPSE-4-(All)-Studio-Albums%5EWaPo

πŸ”’πŸ€– Binary URL Classifier with BERT

Welcome to our Binary URL Classifier! πŸŽ‰

What it Does?

This classifier categorizes URLs into two categories: Good (Non-Malicious) and Bad (Malicious). It uses BERT, a powerful language model, to understand the content and context of URLs for accurate predictions.

How it Works?

  1. Data: Trained on a curated dataset of good and bad URLs from this repo https://github.com/faizann24/Using-machine-learning-to-detect-malicious-URLs/tree/master?tab=readme-ov-file.

  2. Training: Fine-tuned a pre-trained BERT model using Hugging Face's Transformers library.

  3. Evaluation: Validation(15%) and Test(15%) set kept Aside.

  4. Inference: Provides probability scores for new URLs, indicating their likelihood of being good or bad.

Usage πŸš€

  1. Input: Provide a URL to classify.

  2. Output: Receive a probability score for its safety.

Why Trust Us? 🌟

  • Robust Training: Trained on diverse and representative datasets.

  • Evaluation: Rigorously tested for accuracy and reliability.

  • Simple & Effective: Empowers users to make informed decisions about URL safety.

Let's Make the Web Safer! 🌐

Our Binary URL Classifier with BERT is here to enhance online safety. Whether you're a security professional, developer, or concerned user, it helps identify malicious URLs for proactive protection.

Try it now and experience the power of BERT in URL classification! We're open to feedback and suggestions for continuous improvement. Stay safe online! πŸ”’πŸ›‘οΈ