ayjays132
/

QNetworkGPT2Large

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ayjays132 commited on Jan 3, 2024

Commit

2b6dd0b

·

1 Parent(s): d241d7b

Update README.md

Files changed (1) hide show

README.md +38 -29

README.md CHANGED Viewed

@@ -1,31 +1,44 @@
 ---
-model_type: gpt2
-architectures:
-- gpt2
-model_filename: pytorch_model.bin
-hidden_size: 2048
-num_hidden_layers: 12
-num_attention_heads: 12
-intermediate_size: 3072
-hidden_dropout_prob: 0.1
-attention_probs_dropout_prob: 0.1
-max_position_embeddings: 1024
-type_vocab_size: 1
-initializer_range: 0.02
-layer_norm_eps: 0.00001
-vocab_size: 50257
-license: apache-2.0
-datasets:
-- vicgalle/alpaca-gpt4
-language:
-- en
-metrics:
-- accuracy
-- bleu
-library_name: transformers
-pipeline_tag: text-generation
 ---
 ---
 ## Hyperameters used
@@ -56,10 +69,6 @@ Certainly! Here's a consolidated list of hyperparameters for your QNetworkGPT2 R
 Researchers can use these hyperparameters to configure and train their QNetworkGPT2 RL models effectively for text generation tasks.
 ---
-# QNetworkGPT2: Reinventing Text Generation with AI 📝🤖
-![Text Generation](https://static.vecteezy.com/system/resources/previews/023/477/674/non_2x/ai-generative-blue-red-ink-splash-illustration-free-png.png)
 ## Overview
 QNetworkGPT2 is an extraordinary AI model that marries Reinforcement Learning (RL) with the power of the GPT-2 language model to create impressive text generation experiences. 🚀

 ---
+{
+  "activation_function": "gelu_new",
+  "architectures": ["GPT2LMHeadModel"],
+  "model_filename": "pytorch_model.bin",
+  "attn_pdrop": 0.1,
+  "bos_token_id": 50256,
+  "embd_pdrop": 0.1,
+  "eos_token_id": 50256,
+  "initializer_range": 0.02,
+  "layer_norm_epsilon": 1e-05,
+  "model_type": "gpt2",
+  "n_ctx": 1024,
+  "n_embd": 512,
+  "n_head": 12,
+  "n_layer": 12,
+  "n_positions": 1024,
+  "resid_pdrop": 0.1,
+  "summary_activation": null,
+  "summary_first_dropout": 0.1,
+  "summary_proj_to_labels": true,
+  "summary_type": "cls_index",
+  "summary_use_proj": true,
+  "task_specific_params": {
+    "text-generation": {
+      "do_sample": true,
+      "max_length": 50
+    }
+  },
+  "vocab_size": 50257,
+  "hidden_dim": 2048,
+  "input_dim": 512,
+  "output_dim": 512
+}
 ---
+# QNetworkGPT2: Reinventing Text Generation with AI 📝🤖
+![Text Generation](https://static.vecteezy.com/system/resources/previews/023/477/674/non_2x/ai-generative-blue-red-ink-splash-illustration-free-png.png)
 ---
 ## Hyperameters used
 Researchers can use these hyperparameters to configure and train their QNetworkGPT2 RL models effectively for text generation tasks.
 ---
 ## Overview
 QNetworkGPT2 is an extraordinary AI model that marries Reinforcement Learning (RL) with the power of the GPT-2 language model to create impressive text generation experiences. 🚀