--- tags: - bertopic library_name: bertopic pipeline_tag: text-classification --- # C1-topic-model-100 This is a [BERTopic](https://github.com/MaartenGr/BERTopic) model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets. ## Usage To use this model, please install BERTopic: ``` pip install -U bertopic ``` You can use the model as follows: ```python from bertopic import BERTopic topic_model = BERTopic.load("AlexanderHolmes0/C1-topic-model-100") topic_model.get_topic_info() ``` An example of the Chat GPT - 3.5 Turbo representations: !["multiaspect.png"](Reduced_datamapplot-C1-Topics-no-logo.png) ## Topic overview * Number of topics: 100 * Number of training documents: 112332
Click here for an overview of all topics. | Topic | Representation | Count | Name | |--------:|:-----------------------------------------------------------------------------------------------------------------------------------------|--------:|:------------------------------------------------| | -1 | ['stop', 'people', 'need', 'help', 'banking', 'like', 'work', 'business', 'money', 'make'] | 4738 | -1_stop_people_need_help | | 0 | ['account', 'card', 'customer', 'service', 'phone', 'credit', 've', 'payment', 'help', 'number'] | 21155 | 0_account_card_customer_service | | 1 | ['game', 'thank', 'win', 'team', 'bazinga_sb', 'thanks', 'play', 'lounge', 'congrats', 'hype'] | 11918 | 1_game_thank_win_team | | 2 | ['app', 'need', 'stop', 'help', 'lol', 'shit', 'ok', 'like', 'cares', 'guys'] | 11946 | 2_app_need_stop_help | | 3 | ['card', 'credit', 'account', 'customer', 'told', 'company', 'payment', 'called', 'service', 'capitol'] | 6152 | 3_card_credit_account_customer | | 4 | ['friend', 'request', 'friends', 'send', 'mind', 'profile', 'facebook', 'sending', 'don', 'hello'] | 3403 | 4_friend_request_friends_send | | 5 | ['community', 'event', 'lounge', 'thank', 'presented', 'program', 'food', 'business', 'join', 'suppo'] | 4398 | 5_community_event_lounge_thank | | 6 | ['trading', 'whatsapp', 'forex', 'investment', 'profit', 'trade', 'invest', 'contact', 'crypto', 'trader'] | 2804 | 6_trading_whatsapp_forex_investment | | 7 | ['', '', '', '', '', '', '', '', '', ''] | 1191 | 7____ | | 8 | ['ho', 'genial', 'travolta', 'john', 'love', 'shimmmerrrr', 'lovettt', 'nnniiiccceeeee', 'vjjbnm', 'listos'] | 1157 | 8_ho_genial_travolta_john | | 9 | ['santa', 'john', 'travolta', 'love', 'movie', 'christmas', 'ho', 'moves', 'claus', 'sexy'] | 1695 | 9_santa_john_travolta_love | | 10 | ['que', 'en', 'la', 'para', 'el', 'es', 'una', 'por', 'los', 'mi'] | 1532 | 10_que_en_la_para | | 11 | ['worst', 'service', 'company', 'customer', 'bank', 'horrible', 'terrible', 'card', 'account', 'don'] | 2417 | 11_worst_service_company_customer | | 12 | ['love', 'awesome', 'happy', 'best', 'great', 'brilliant', 'wow', 'bihday', 'cool', 'amen'] | 2736 | 12_love_awesome_happy_best | | 13 | ['people', 'blessed', 'grands', 'cus', 'paying', 'stay', 'hard', 'god', 'struggling', 'message'] | 896 | 13_people_blessed_grands_cus | | 14 | ['later', 'connect', 'definitely', 'inbox', 'hello', 'thank', 'mr', 'message', 'll', 'hi'] | 4157 | 14_later_connect_definitely_inbox | | 15 | ['shimmerr', 'kt', 'txscapitalone', 'capit', 'word', 'givaway', 'bejeweled', 'shine', 'ts', 'girl'] | 535 | 15_shimmerr_kt_txscapitalone_capit | | 16 | ['credit', 'card', 'score', 'limit', 'pay', 'balance', 'cards', 'increase', 'payment', 'personal'] | 5072 | 16_credit_card_score_limit | | 17 | ['spell', 'dr', 'caster', 'ex', 'lover', 'marriage', 'husband', 'love', 'relationship', 'page'] | 672 | 17_spell_dr_caster_ex | | 18 | ['love', 'business', 'thank', 'credit', 'best', 'card', 'bank', 'great', 'capitol', 'cards'] | 2133 | 18_love_business_thank_credit | | 19 | ['links', 'mm', 'click', 'nutshell', 'join', 'corey', 'referral', 'unabridged', 'gelmtree', 'chainzz'] | 515 | 19_links_mm_click_nutshell | | 20 | ['travel', 'points', 'venture', 'miles', 'flights', 'trip', 'flight', 'book', 'bonus', 'card'] | 1271 | 20_travel_points_venture_miles | | 21 | ['slash', 'guitar', 'guitarist', 'riff', 'song', 'play', 'solo', 'guns', 'playing', 'roses'] | 630 | 21_slash_guitar_guitarist_riff | | 22 | ['reachout', 'miller', 'david', 'comes', 'investing', 'financial', 'help', 'diane', 'diana', 'christine'] | 303 | 22_reachout_miller_david_comes | | 23 | ['preston', 'mourning', 'underneath', 'commenting', 'kelly', 'clicking', 'loss', 'management', 'wife', 'travolta'] | 211 | 23_preston_mourning_underneath_commenting | | 24 | ['tnt', 'qbs', 'reminded', 'tune', 'headed', 'vegas', 'tweet', 'june', 'time', 'banners'] | 222 | 24_tnt_qbs_reminded_tune | | 25 | ['working', 'website', 'isn', 'expired', 'enter', 'app', 'haven', 'gotten', 'doesn', 'work'] | 947 | 25_working_website_isn_expired | | 26 | ['suck', 'sucks', 'lol', 'fake', 'sounds', 'joke', 'trash', 'stupid', 'bad', 'shit'] | 1603 | 26_suck_sucks_lol_fake | | 27 | ['love', 'cute', 'absolutely', 'ittttt', 'lovelove', 'loveeeee', 'song', 'grandkids', 'awesome', 'dancing'] | 292 | 27_love_cute_absolutely_ittttt | | 28 | ['mobile', 'app', 'digital', 'options', 'shopping', 'set', 'safely', 'card', 'banking', 'phone'] | 554 | 28_mobile_app_digital_options | | 29 | ['cardigan', 'flannel', 'plaid', 'tsxccapitalone', 'cardigans', 'caedigan', 'taxcaptialone', 'ysxcapitalone', 'robe', 'tsxcapit'] | 191 | 29_cardigan_flannel_plaid_tsxccapitalone | | 30 | ['presale', 'event', 'taylorswift', 'venue', 'np', 'cardholder', 'events', 'gain', 'staing', 'stas'] | 403 | 30_presale_event_taylorswift_venue | | 31 | ['donna', 'pescow', 'annette', 'fever', 'looks', 'saturday', 'cameo', 'movie', 'night', 'actress'] | 340 | 31_donna_pescow_annette_fever | | 32 | ['card', 'best', 'great', 'love', 'cards', 'got', 'credit', 'better', 'chase', 'good'] | 632 | 32_card_best_great_love | | 33 | ['flannel', 'flannnel', 'scymbags', 'rsxcapitalone', 'pumpkinseason', 'xgiveaway', 'tsxca', 'fallvibes', 'tsxcpaitalone', 'cozyflannel'] | 172 | 33_flannel_flannnel_scymbags_rsxcapitalone | | 34 | ['car', 'auto', 'navigator', 'cars', 'buying', 'vehicle', 'dealership', 'truck', 'pre', 'suv'] | 566 | 34_car_auto_navigator_cars | | 35 | ['๐—๐—๐–บ๐—', '๐–บ๐—‡๐–ฝ', '๐—ˆ๐—Ž๐—‹', '๐—’๐—ˆ๐—Ž', '๐—๐—ˆ', '๐–บ๐–ผ๐–ผ๐—ˆ๐—Ž๐—‡๐—', '๐—’๐—ˆ๐—Ž๐—‹', '๐–ผ๐—ˆ๐—‰๐—’๐—‹๐—‚๐—€๐—๐—', '๐—๐—๐–พ', '๐ฒ๐จ๐ฎ'] | 416 | 35_๐—๐—๐–บ๐—_๐–บ๐—‡๐–ฝ_๐—ˆ๐—Ž๐—‹_๐—’๐—ˆ๐—Ž | | 36 | ['sure', 'true', 'correct', 'yessir', 'yea', 'yessssss', 'yess', 'yesss', 'duh', 'thank'] | 329 | 36_sure_true_correct_yessir | | 37 | ['beard', 'chef', 'awards', 'beardfoundation', 'james', 'restaurant', 'presented', 'chefs', 'taste', 'jbfa'] | 223 | 37_beard_chef_awards_beardfoundation | | 38 | ['worst', 'bank', 'service', 'customer', 'fucking', 'hate', 'banking', 'trash', 'company', 'worse'] | 1098 | 38_worst_bank_service_customer | | 39 | ['bank', 'banks', 'banking', 'best', 'chairman', 'branches', 'chairperson', 'great', 'dg', 'atms'] | 397 | 39_bank_banks_banking_best | | 40 | ['student', 'unlimited', 'key', 'quicksilver', 'cash', 'savorone', 'earning', 'earn', 'enteainment', 'presents'] | 343 | 40_student_unlimited_key_quicksilver | | 41 | ['teambradgers', 'hardwood', 'winning', 'head', 'winners', 'congratulations', 'luck', 'came', 'close', 'better'] | 178 | 41_teambradgers_hardwood_winning_head | | 42 | ['giveway', 'tstheerastour', 'tsxcapitalone', 'givesway', 'givaway', 'leopard', 'tscott', 'giveaway', 'tsxcapital', 'ts'] | 155 | 42_giveway_tstheerastour_tsxcapitalone_givesway | | 43 | ['love', 'awesome', 'cute', 'omg', 'absolutely', 'cool', 'oh', 'niiice', 'loooovvve', 'fantastic'] | 386 | 43_love_awesome_cute_omg | | 44 | ['postdoctoral', 'ฮบฮฑฮน', '๐ญ๐จ', '๐š๐ง๐', 'ุฑุณูˆู„', '๐—’๐—ˆ๐—Ž๐—‹', 'ฮฝฮฑ', 'mr', '๐—๐—๐–พ', 'ใ…คใ…ค'] | 357 | 44_postdoctoral_ฮบฮฑฮน_๐ญ๐จ_๐š๐ง๐ | | 45 | ['platinum', 'excluded', 'cards', 'credit', 'secured', 'card', 'mastercard', 'creditstacks', 'crรฉdito', 'deposit'] | 127 | 45_platinum_excluded_cards_credit | | 46 | ['solving', 'ai', 'data', 'methods', 'learning', 'oracle', 'systems', 'algebraic', 'etem', 'machine'] | 219 | 46_solving_ai_data_methods | | 47 | ['dm', 'sent', 'haven', 'gotten', 'dms', 'check', 'received', 'response', 'answer', 'reply'] | 289 | 47_dm_sent_haven_gotten | | 48 | ['nice', 'cool', 'good', 'cute', 'dimples', 'pretty', 'funny', 'lovely', 'hello', 'hey'] | 206 | 48_nice_cool_good_cute | | 49 | ['cafรฉs', 'branches', 'offices', 'closed', 'holiday', 'app', 'recognition', 'celebration', 'january', 'observance'] | 130 | 49_cafรฉs_branches_offices_closed | | 50 | ['gt', 'page', 'javier', 'facebook', 'sergio', 'boutique', 'account', 'salvation', 'chalk', 'steps'] | 191 | 50_gt_page_javier_facebook | | 51 | ['fault', 'mailbox', 'buyer', 'flood', 'vendors', 'loves', 'costs', 'fyi', 'shady', 'lines'] | 89 | 51_fault_mailbox_buyer_flood | | 52 | ['responsibility', 'expect', 'accept', 'unethical', 'perjury', 'business', 'involved', 'guilty', 'abusing', 'purposes'] | 88 | 52_responsibility_expect_accept_unethical | | 53 | ['holiday', 'encore', 'noรซl', 'livestream', 'songs', 'album', 'spirit', 'performance', 'featuring', 'season'] | 44 | 53_holiday_encore_noรซl_livestream | | 54 | ['bio', 'webinar', 'mcws', 'capitalonecafe', 'wcwsselfie', 'program', 'unprecedented', 'predictable', 'brand', 'capitalize'] | 565 | 54_bio_webinar_mcws_capitalonecafe | | 55 | ['agree', 'true', 'haha', 'ha', 'right', 'mendez', 'dianna', 'sure', 'yea', 'exactly'] | 324 | 55_agree_true_haha_ha | | 56 | ['word', 'code', 'shib', 'enter', 'biiiish', 'shimma', 'shimmerr', 'txscapitalone', 'gotta', 'hashtags'] | 113 | 56_word_code_shib_enter | | 57 | ['uber', 'eats', 'complimentary', 'orders', 'nov', 'membership', 'eligible', 'unlimited', 'terms', 'cardholders'] | 46 | 57_uber_eats_complimentary_orders | | 58 | ['bbb', 'tracker', 'scams', 'scam', 'scamtracker', 'bbbscamtracker', 'org', 'amazon', 'detect', 'protect'] | 85 | 58_bbb_tracker_scams_scam | | 59 | ['count', 'days', 'fucking', 'bitch', 'fuckin', 'counting', 'yall', 'ups', 'ur', 'numbered'] | 89 | 59_count_days_fucking_bitch | | 60 | ['tradelines', 'cpn', 'repair', 'removal', 'inquiries', 'sba', 'score', 'loans', 'boosting', 'credit'] | 102 | 60_tradelines_cpn_repair_removal | | 61 | ['gas', 'prices', 'inflation', 'cars', 'electric', 'gasoline', 'falling', 'climbing', 'electricity', 'economy'] | 159 | 61_gas_prices_inflation_cars | | 62 | ['spark', 'antonelli', 'preset', 'cash', 'cheese', 'unlimited', 'helps', 'shop', 'spending', 'grow'] | 57 | 62_spark_antonelli_preset_cash | | 63 | ['waiting', 'room', 'queue', 'open', 'lounge', 'wait', 'denver', 'elevator', 'does', 'presale'] | 455 | 63_waiting_room_queue_open | | 64 | ['need', 'want', 'got', 'officce', 'wish', 'rhode', 'interested', 'waiting', 'waverly', 'needs'] | 371 | 64_need_want_got_officce | | 65 | ['taher', 'bapary', 'abu', 'wow', 'baah', 'testyes', 'relaxday', 'ernest', 'videos', 'flower'] | 46 | 65_taher_bapary_abu_wow | | 66 | ['atos', 'story', 'cloud', 'datacenter', 'upl', 'transformative', 'wp', 'laid', 'personalized', 'content'] | 77 | 66_atos_story_cloud_datacenter | | 67 | ['song', 'itunes', 'radio', 'rainy', 'bts', 'cha', 'ranked', 'uk', 'play', 'hear'] | 163 | 67_song_itunes_radio_rainy | | 68 | ['dining', 'cardholders', 'reservations', 'restaurants', 'rated', 'curated', 'reservation', 'culinary', 'rewards', 'sought'] | 69 | 68_dining_cardholders_reservations_restaurants | | 69 | ['grant', 'apply', 'upfront', 'federal', 'government', 'grants', 'retired', 'program', 'public', 'assistance'] | 103 | 69_grant_apply_upfront_federal | | 70 | ['scarf', 'tsgiveaway', 'towel', 'tote', 'bag', 'tstheerastour', 'red', 'beach', 'kept', 'old'] | 53 | 70_scarf_tsgiveaway_towel_tote | | 71 | ['annual', 'fee', 'fees', 'loffland', 'tam', 'yearly', 'spend', 'year', 'charged', 'pay'] | 204 | 71_annual_fee_fees_loffland | | 72 | ['unsubcribe', 'mailing', 'stop', 'remove', 'list', 'emails', 'gk', 'removed', 'email', 'unsubscribed'] | 106 | 72_unsubcribe_mailing_stop_remove | | 73 | ['crooks', 'money', 'scumbags', 'stole', 'thieves', 'steal', 'fraud', 'counts', 'people', 'scam'] | 633 | 73_crooks_money_scumbags_stole | | 74 | ['mcws', 'rebs', 'omaha', 'wps', 'toddy', 'hotty', 'game', 'olemaha', 'hottytoddy', 'hogs'] | 105 | 74_mcws_rebs_omaha_wps | | 75 | ['bejeweled', 'ok', 'hi', 'mm', 'tay', 'hola', 'epic', 'ally', 'maria', 'gay'] | 719 | 75_bejeweled_ok_hi_mm | | 76 | ['pov', 'misclicked', 'clicked', 'clicky', 'pa', 'screen', 'misclick', 'missed', 'did', 'click'] | 174 | 76_pov_misclicked_clicked_clicky | | 77 | ['financial', 'tips', 'budget', 'empowerment', 'healthy', 'goals', 'literacy', 'money', 'check', 'save'] | 379 | 77_financial_tips_budget_empowerment | | 78 | ['wynn', 'match', 'brady', 'hole', 'las', 'tom', 'returns', 'rodgers', 'patrick', 'vegas'] | 70 | 78_wynn_match_brady_hole | | 79 | ['upgrade', 'upgrades', 'upgraded', 'update', 'seats', 'seat', 'app', 'increase', 'upgradeeee', 'darkmode'] | 142 | 79_upgrade_upgrades_upgraded_update | | 80 | ['step', 'tapn', 'wit', 'walking', 'method', 'helped', 'tapp', 'got', 'paid', 'split'] | 120 | 80_step_tapn_wit_walking | | 81 | ['facilitate', 'form', 'cybersuppospy', 'communicate', 'following', 'expedite', 'channel', 'dispute', 'suppo', 'promptly'] | 46 | 81_facilitate_form_cybersuppospy_communicate | | 82 | ['debt', 'credit', 'need', 'card', 'dm', 'pay', 'debts', 'help', 'bills', 'cards'] | 375 | 82_debt_credit_need_card | | 83 | ['press', 'interactive', 'seat', 'conference', 'hot', 'et', 'qbs', 'questions', 'reply', 'answer'] | 31 | 83_press_interactive_seat_conference | | 84 | ['minority', 'jackie', 'scholarship', 'nation', 'donating', 'premier', 'programs', 'students', 'leadership', 'walk'] | 39 | 84_minority_jackie_scholarship_nation | | 85 | ['chefs', 'chef', 'exclusive', 'dining', 'cardholders', 'recognized', 'viue', 'erick', 'hyde', 'restaurants'] | 30 | 85_chefs_chef_exclusive_dining | | 86 | ['flannel', 'cardigan', 'scarf', 'jk', 'code', 'lol', 'word', 'tis', 'oops', 'seamstr'] | 91 | 86_flannel_cardigan_scarf_jk | | 87 | ['sell', 'sold', 'sellout', 'selling', 'tim', 'marketing', 'piece', 'wage', 'saying', 'genius'] | 90 | 87_sell_sold_sellout_selling | | 88 | ['movie', 'saw', 'watch', 'watching', 'video', 'tv', 'watched', 'love', 'night', 'seen'] | 411 | 88_movie_saw_watch_watching | | 89 | ['denied', 'approved', 'applied', 'pre', 'tried', 'qualify', 'won', 'didn', 'let', 'apply'] | 188 | 89_denied_approved_applied_pre | | 90 | ['balali', 'dizabali', 'manager', 'blame', 'changing', 'bought', 'complete', 'withdrawal', 'oppounity', 'invest'] | 24 | 90_balali_dizabali_manager_blame | | 91 | ['office', 'direct', 'contact', 'don', 'erica', 'renan', 'lizarralde', 'vallecilla', 'kirklnd', 'offices'] | 60 | 91_office_direct_contact_don | | 92 | ['stimulus', 'latest', 'mon', 'dates', 'track', 'round', 'government', 'federal', 'grow', 'transferred'] | 23 | 92_stimulus_latest_mon_dates | | 93 | ['itbot', 'cash', 'gyvi', 'creditcard', 'best', 'wallethub', 'credit', 'visa', 'mastercard', 'couple'] | 199 | 93_itbot_cash_gyvi_creditcard | | 94 | ['pakistan', 'donate', 'relief', 'flood', 'activities', 'purchase', 'islamabad', 'belgium', 'rawalpindi', 'ptt'] | 27 | 94_pakistan_donate_relief_flood | | 95 | ['lottery', 'winner', 'jackpot', 'fan', 'million', 'powerball', 'sum', 'funds', 'dollars', 'lucky'] | 77 | 95_lottery_winner_jackpot_fan | | 96 | ['payments', 'sending', 'email', 'payment', 'suppo', 'mail', 'stop', 'mailed', 'check', 'days'] | 127 | 96_payments_sending_email_payment | | 97 | ['cardholders', 'star', 'mlb', 'ultimate', 'allstarweek', 'giveaways', 'performances', 'official', 'gate', 'highlights'] | 57 | 97_cardholders_star_mlb_ultimate | | 98 | ['reds', 'disparage', 'toss', 'stats', 'count', 'overrated', 'weight', 'pure', 'vccs', 'acquaintd'] | 84 | 98_reds_disparage_toss_stats |
## Training hyperparameters * calculate_probabilities: False * language: None * low_memory: False * min_topic_size: 10 * n_gram_range: (1, 1) * nr_topics: 100 * seed_topic_list: None * top_n_words: 10 * verbose: True ## Framework versions * Numpy: 1.26.4 * HDBSCAN: 0.8.33 * UMAP: 0.5.5 * Pandas: 2.0.3 * Scikit-Learn: 1.4.1.post1 * Sentence-transformers: 2.5.1 * Transformers: 4.40.0 * Numba: 0.59.1 * Plotly: 5.20.0 * Python: 3.11.8