nvidia
/

Llama-2-7B-DMC-8x

Model card Files Files and versions Community

alancucki commited on 29 days ago

Commit

8d85067

·

verified ·

1 Parent(s): 7a1f71c

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -86,9 +86,9 @@ NVIDIA models are trained on a diverse set of public and proprietary datasets. T
 ## AI Safety Efforts
 The Llama-2-7B-DMC-8x model underwent AI safety evaluation including adversarial testing via three distinct methods:
--[Garak](https://github.com/leondz/garak), is an automated LLM vulnerability scanner that probes for common weaknesses, including prompt injection and data leakage.
--[AEGIS](https://huggingface.co/datasets/nvidia/Aegis-AI-Content-Safety-Dataset-1.0), is a content safety evaluation dataset and LLM based content safety classifier model, that adheres to a broad taxonomy of 13 categories of critical risks in human-LLM interactions.
--Human Content Red Teaming leveraging human interaction and evaluation of the models' responses.
 ## Inference

 ## AI Safety Efforts
 The Llama-2-7B-DMC-8x model underwent AI safety evaluation including adversarial testing via three distinct methods:
+* [Garak](https://github.com/leondz/garak), is an automated LLM vulnerability scanner that probes for common weaknesses, including prompt injection and data leakage.
+* [AEGIS](https://huggingface.co/datasets/nvidia/Aegis-AI-Content-Safety-Dataset-1.0), is a content safety evaluation dataset and LLM based content safety classifier model, that adheres to a broad taxonomy of 13 categories of critical risks in human-LLM interactions.
+* Human Content Red Teaming leveraging human interaction and evaluation of the models' responses.
 ## Inference