Shayne Longpre

Shayne

AI & ML interests

ML, NLP, Multilinguality, responsible/fair use of intelligent algorithms

Recent Activity

Organizations

BigScience Catalogue Data's profile picture BigScience WG for evaluation of bias, fairness, and social impact's profile picture 🤗 H4 Community's profile picture Data Provenance Initiative's profile picture MIT Team for Data Provenance Analysis's profile picture

Shayne's activity

reacted to yjernite's post with 🤗❤️ 11 months ago
view post
Post
👷🏽‍♀️📚🔨 Announcing the Foundation Model Development Cheatsheet!

My first 🤗Post🤗 ever to announce the release of a fantastic collaborative resource to support model developers across the full development stack: The FM Development Cheatsheet available here: https://fmcheatsheet.org/

The cheatsheet is a growing database of the many crucial resources coming from open research and development efforts to support the responsible development of models. This new resource highlights essential yet often underutilized tools in order to make it as easy as possible for developers to adopt best practices, covering among other aspects:
🧑🏼‍🤝‍🧑🏼 data selection, curation, and governance;
📖 accurate and limitations-aware documentation;
⚡ energy efficiency throughout the training phase;
📊 thorough capability assessments and risk evaluations;
🌏 environmentally and socially conscious deployment strategies.

We strongly encourage developers working on creating and improving models to make full use of the tools listed here, and to help keep the resource up to date by adding the resources that you yourself have developed or found useful in your own practice 🤗

Congrats to all the participants in this effort for the release! Read more about it from:
@Shayne - https://twitter.com/ShayneRedford/status/1763215814860186005
@hails and @stellaathena - https://blog.eleuther.ai/fm-dev-cheatsheet/
@alon-albalak - http://nlp.cs.ucsb.edu/blog/a-new-guide-for-the-responsible-development-of-foundation-models.html

And also to @gabrielilharco @sayashk @kklyman @kylel @mbrauh @fauxneticien @avi-skowron @Bertievidgen Laura Weidinger, Arvind Narayanan, @VictorSanh @Davlan @percyliang Rishi Bommasani, @breakend @sasha 🔥
  • 1 reply
·
reacted to VictorSanh's post with 👍 11 months ago
view post
Post
An increasing number of engineers and researchers are developing foundational models. Navigating the tools, resources, codebases, and best practices guides is daunting for new contributors.

Introducing the Foundation Model Development Cheatsheet, a succinct guide with 250+ resources & tools for:
📖 sourcing data
🔍 documenting & audits
🌍 environmental impact
🥊 risks & harms eval
🎮 release & monitoring

https://fmcheatsheet.org/

👐 What tools & resources should appear in that cheatsheet? Contributions encouraged!

This is the result of a large collaboration between many organizations promoting open-science, and spearheaded by @Shayne 🔥
  • 2 replies
·
New activity in huggingface/text-data-filtering over 1 year ago

Code link broken in demo

#4 opened over 1 year ago by
Shayne
New activity in SirNeural/flan_v2 almost 2 years ago

Updates to Flan V2 repo

10
#8 opened almost 2 years ago by
Shayne
New activity in google/flan-t5-xl almost 2 years ago

Non-English languages not working

5
#6 opened about 2 years ago by
diwank
New activity in google/flan-t5-xxl almost 2 years ago