general-mar11

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("Thang203/general-mar11")

topic_model.get_topic_info()

Topic overview

  • Number of topics: 20
  • Number of training documents: 12146
Click here for an overview of all topics.
Topic ID Topic Keywords Topic Frequency Label
-1 models - language - llms - language models - large 11 -1_models_language_llms_language models
0 models - language - language models - model - llms 3795 0_models_language_language models_model
1 code - language - llms - models - large 3252 1_code_language_llms_models
2 visual - image - models - multimodal - video 1236 2_visual_image_models_multimodal
3 detection - text - models - attacks - adversarial 724 3_detection_text_models_attacks
4 bias - models - biases - llms - gender 610 4_bias_models_biases_llms
5 medical - clinical - models - language - llms 451 5_medical_clinical_models_language
6 legal - financial - sentiment - models - language 437 6_legal_financial_sentiment_models
7 ai - chatgpt - generative - design - generative ai 377 7_ai_chatgpt_generative_design
8 privacy - data - private - models - federated 274 8_privacy_data_private_models
9 students - education - chatgpt - ai - student 201 9_students_education_chatgpt_ai
10 driving - autonomous - autonomous driving - traffic - spatial 164 10_driving_autonomous_autonomous driving_traffic
11 protein - molecular - materials - chemical - drug 113 11_protein_molecular_materials_chemical
12 reinforcement learning - reinforcement - learning - rl - policy 103 12_reinforcement learning_reinforcement_learning_rl
13 math - mathematical - problems - reasoning - theorem 100 13_math_mathematical_problems_reasoning
14 vulnerability - code - vulnerabilities - security - log 91 14_vulnerability_code_vulnerabilities_security
15 forecasting - climate - time series - data - carbon 78 15_forecasting_climate_time series_data
16 style - poetry - style transfer - transfer - poems 78 16_style_poetry_style transfer_transfer
17 regression - matrix - softmax - mathbbrn - bf 37 17_regression_matrix_softmax_mathbbrn
18 recipes - recipe - food - cooking - dietary 14 18_recipes_recipe_food_cooking

Training hyperparameters

  • calculate_probabilities: False
  • language: None
  • low_memory: False
  • min_topic_size: 10
  • n_gram_range: (1, 1)
  • nr_topics: 20
  • seed_topic_list: None
  • top_n_words: 10
  • verbose: True
  • zeroshot_min_similarity: 0.7
  • zeroshot_topic_list: None

Framework versions

  • Numpy: 1.25.2
  • HDBSCAN: 0.8.33
  • UMAP: 0.5.5
  • Pandas: 1.5.3
  • Scikit-Learn: 1.2.2
  • Sentence-transformers: 2.5.1
  • Transformers: 4.38.2
  • Numba: 0.58.1
  • Plotly: 5.15.0
  • Python: 3.10.12
Downloads last month
4
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.