David Vivancos's picture

David Vivancos

DavidVivancos

AI & ML interests

Machine Learning, Deep Learning, NeuroTechnologies

Organizations

CVPR Demo Track's profile picture Open-Source AI Meetup's profile picture ICCV2023's profile picture ICML2023's profile picture Social Post Explorers's profile picture

DavidVivancos's activity

reacted to dylanebert's post with ๐Ÿ‘€ 4 months ago
view post
Post
2814
Keep track of the latest 3D releases with this space๐Ÿ‘‰ dylanebert/research-tracker
  • 2 replies
ยท
posted an update 6 months ago
view post
Post
1723
#ICLM 2024 is almost there ๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅ PM if you will be in Vienna next week, Glad to catchup with the Hugging Face community there!

I would like to contribute ๐ŸŽ by releasing the sixth Knowledge Vault, with 100 lectures visualized from the last 10 years of ICML from 2014 to 2024, (10 from 2024 will be included after the conference) including knowledge graphs for all the Invited Lectures and some extras, with almost 3000 topics represented using AI.

You can explore it here:
๐ŸŒ https://theendofknowledge.com/Vaults/6/ICML-2015-2024.html

And you can learn more about the Vaults here:
๐Ÿ“https://www.linkedin.com/pulse/knowledge-vaults-david-vivancos-lbjef/

And previous Vaults relevant to the #huggingface community are:

๐ŸŒ [ @lexfridman 2018-2024 Interviews] https://theendofknowledge.com/Vaults/1/Lex100-2024.html

๐ŸŒ [ICLR 2014-2023] https://theendofknowledge.com/Vaults/2/ICLR2014-2023.html

๐ŸŒ [AIForGood 2017-2024] https://theendofknowledge.com/Vaults/4/AIForGood2017-2024.html

๐ŸŒ [CVPR 2015-2024] https://theendofknowledge.com/Vaults/5/CVPR-2015-2024.html

Hope you like them!

And great to see you all at #icml2024 @clem @thomwolf @julien-c and team
posted an update 9 months ago
view post
Post
1974
#ICLR 2024 is almost there ๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅ counting the days to be again in the beautiful city of Vienna participating in the The Twelfth International Conference on Learning Representations, hope to see many of the Hugging Face comunity there!

I would like to contribute ๐ŸŽ by releasing the second Knowledge Vault, with 100 lectures visualized from the last 10 years of ICLR from 2014 to 2023, including knowledge graphs for all the Invited Lectures and some extras, with almost 3000 topics represented. (Of course using several AI tools including Llama3)

You can explore it here:
๐ŸŒ https://theendofknowledge.com/Vaults/2/ICLR2014-2023.html

And you can learn more about the Vaults here:
๐Ÿ“https://www.linkedin.com/pulse/knowledge-vaults-david-vivancos-lbjef/

Hope you like the Knowledge Vault!
reacted to akhaliq's post with โค๏ธ 11 months ago
view post
Post
Design2Code

How Far Are We From Automating Front-End Engineering?

Design2Code: How Far Are We From Automating Front-End Engineering? (2403.03163)

Generative AI has made rapid advancements in recent years, achieving unprecedented capabilities in multimodal understanding and code generation. This can enable a new paradigm of front-end development, in which multimodal LLMs might directly convert visual designs into code implementations. In this work, we formalize this as a Design2Code task and conduct comprehensive benchmarking. Specifically, we manually curate a benchmark of 484 diverse real-world webpages as test cases and develop a set of automatic evaluation metrics to assess how well current multimodal LLMs can generate the code implementations that directly render into the given reference webpages, given the screenshots as input. We also complement automatic metrics with comprehensive human evaluations. We develop a suite of multimodal prompting methods and show their effectiveness on GPT-4V and Gemini Pro Vision. We further finetune an open-source Design2Code-18B model that successfully matches the performance of Gemini Pro Vision. Both human evaluation and automatic metrics show that GPT-4V performs the best on this task compared to other models. Moreover, annotators think GPT-4V generated webpages can replace the original reference webpages in 49% of cases in terms of visual appearance and content; and perhaps surprisingly, in 64% of cases GPT-4V generated webpages are considered better than the original reference webpages. Our fine-grained break-down metrics indicate that open-source models mostly lag in recalling visual elements from the input webpages and in generating correct layout designs, while aspects like text content and coloring can be drastically improved with proper finetuning.
ยท
replied to akhaliq's post 11 months ago
replied to their post 12 months ago
view reply

Hi @clefourrier will be, atm they are drawn directly from the authors reported data in their papers linkend in the leaderboard.

posted an update 12 months ago
view post
Post
Are you up to a ๐Ÿค— challenge ๐Ÿ†โ€‹ ?
if so ๐Ÿ‘€โ€‹ Check out the new MindBigData Leaderboard ๐Ÿ”ฅโ€‹๐Ÿ”ฅโ€‹๐Ÿ”ฅโ€‹
๐Ÿš€ DavidVivancos/MindBigData-Leaderboard

Decode the "source" ๐Ÿง โ€‹ with the largest multimodal opendata of brain signals for Machine Learning.

Try to beat the whopping ๐Ÿฅ‡โ€‹98,97% accuracy of Smita Tiwari Shivani Goel School of CSET Bennett University, India and Arpit Bhardwaj BML Munjal University decoding the multiclass Yann LeCun mnist of brain digits caputured with EMOTIV Epoc and ๐Ÿฅ‡โ€‹89,62% with Insight

Or the ๐Ÿฅ‡โ€‹96,18% of Dr. Nrushingh Charan Mahapatra Intel Corporation and Prof.(Dr). Prachet Bhuyan Kalinga Institute of Industrial Technology, Bhubaneswar also with the mnist of brain digits but captured with Museยฎ by Interaxon Inc.

Or the ๐Ÿฅ‡โ€‹85% of Matthew Zhang Westlake High School and Jeremy Lu now at Purdue University decoding brain images captured from imagenet

Or be the first to break the ๐ŸงŠ with the largest open dataset 2023 (8+ billion datapoints), the multimodal MindBigData2023_MNIST-8B captured with a custom 128 channels EEG that I built and with the real 70,000 MNIST digits and put your NVIDIA gpus to work.

All the datasets are open and ready at HuggingFace, dare to try?

Hope to see you all soon in the LeaderBoard

Thanks
@DavidVivancos
  • 2 replies
ยท