barun's picture

3

barun

adwit

AdwitBarun

AI & ML interests

machine learning

Organizations

None yet

adwit's activity

upvoted 3 papers 7 months ago

Breaking Boundaries: Investigating the Effects of Model Editing on Cross-linguistic Performance

Paper • 2406.11139 • Published Jun 17, 2024 • 13

Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations

Paper • 2406.11801 • Published Jun 17, 2024 • 16

SafeInfer: Context Adaptive Decoding Time Safety Alignment for Large Language Models

Paper • 2406.12274 • Published Jun 18, 2024 • 15