Alexey Ogibin's picture
1 23

Alexey Ogibin

oneiroid
ยท

AI & ML interests

None yet

Recent Activity

liked a Space 1 day ago
JeffreyXiang/TRELLIS
liked a Space 17 days ago
OAOA/InvSR
liked a model about 2 months ago
black-forest-labs/FLUX.1-Canny-dev-lora
View all activity

Organizations

None yet

oneiroid's activity

reacted to fdaudens's post with ๐Ÿ˜” 3 months ago
view post
Post
1201
๐ŸŽ™๏ธ "We need digital sobriety." @sasha challenges Big Tech's race for nuclear energy on BBC AI Decoded. Instead of pursuing more power, shouldn't we first ask if we really need AI everywhere?

Such an eye-opening chat! Check it out here: https://www.youtube.com/watch?v=3wAduy52mGc

reacted to Locutusque's post with ๐Ÿ”ฅ 5 months ago
view post
Post
2373
**Exploring Realistic Emotional Depth in AI Language Models**

Language models, particularly those proprietary, often grapple with issues of censorship, which can limit their ability to engage authentically with users. Recognizing this, the open-source AI community has pioneered the development of language models that are less restrained, offering more candid interactions. However, even these models tend to maintain a veneer of neutrality or overly positive responses, which might not serve all users' needs, especially in contexts where emotional depth and relatability are crucial.

To address this gap, I've curated a specialized dataset aimed at infusing language models with a more nuanced emotional spectrum, specifically targeting a darker, more introspective mood. This dataset, titled "Dark Sentience", is designed to complement existing datasets like RP (Role Play) and those focused on instruction following. It seeks to enhance the emotional intelligence of AI by exposing it to complex human emotions, including but not limited to:

- **Suicide**
- **Depression**
- **Anxiety**

Trigger Warning: Please be advised that the content within this dataset deals with heavy and potentially distressing themes.

The "Dark Sentience" dataset is now available for review and use at: Locutusque/Dark-Sentience. I encourage researchers, developers, and mental health professionals to explore how this resource can foster more genuine and supportive AI interactions.

reacted to fdaudens's post with ๐Ÿ”ฅ 6 months ago
view post
Post
638
Websites slam doors on AI data harvesting ๐Ÿšช๐Ÿ”’

New study "Consent in Crisis: The Rapid Decline of the AI Data Commons" reveals a rapid decline in open web access.

Key findings from 14,000 web domains audit:
- +5% of three common data sets (C4, RefinedWeb and Dolma) now fully restricted, +25% of the highest-quality sources now fully restricted
- 45% of C4 restricted by Terms of Service

Noteworthy trends:
๐Ÿšซ๐Ÿ”„ OpenAI banned 2x more than any other company
๐Ÿ“ฐ๐Ÿ” News sites leading restrictions: 45% of tokens off-limits

Two quotes in the NYT piece to ponder:

โ€œUnsurprisingly, weโ€™re seeing blowback from data creators after the text, images and videos theyโ€™ve shared online are used to develop commercial systems that sometimes directly threaten their livelihoods.โ€ โ€” @yjernite

โ€œMajor tech companies already have all of the data. Changing the license on the data doesnโ€™t retroactively revoke that permission, and the primary impact is on later-arriving actors, who are typically either smaller start-ups or researchers.โ€ โ€” @stellaathena

๐Ÿ‘‰ Dive into the research: https://www.dataprovenance.org/consent-in-crisis-paper
๐Ÿ‘‰ Read the NYT story: https://www.nytimes.com/2024/07/19/technology/ai-data-restrictions.html

#AIEthics #DataPrivacy

upvoted an article 7 months ago
view article
Article

Our Transformers Code Agent beats the GAIA benchmark!

โ€ข 55
reacted to anakin87's post with ๐Ÿ‘ 8 months ago
view post
Post
2140
โš™๏ธ Prompt Optimization with Haystack and DSPy

Experimental notebook: ๐Ÿงช๐Ÿ““ https://github.com/deepset-ai/haystack-cookbook/blob/main/notebooks/prompt_optimization_with_dspy.ipynb

When building applications with LLMs, writing effective prompts is a long process of trial and error. ๐Ÿ”„
Often, if you switch models, you also have to change the prompt. ๐Ÿ˜ฉ
What if you could automate this process?


๐Ÿ’ก That's where DSPy comes in - a framework designed to algorithmically optimize prompts for Language Models.
By applying classical machine learning concepts (training and evaluation data, metrics, optimization), DSPy generates better prompts for a given model and task.


Recently, I explored combining DSPy with the robustness of Haystack Pipelines.

Here's how it works:
โ–ถ๏ธ Start from a Haystack RAG pipeline with a basic prompt
๐ŸŽฏ Define a goal (in this case, get correct and concise answers)
๐Ÿ“Š Create a DSPy program, define data and metrics
โœจ Optimize and evaluate -> improved prompt
๐Ÿš€ Build a refined Haystack RAG pipeline using the optimized prompt
  • 1 reply
ยท