What advantage does this have over normal algorithmic ways of turning HTML to Markdown ?

by MohamedRashad - opened 1 day ago

1 day ago

I don't understand why would i use this instead of going directly to a simple tool that will convert my HTML to Markdown. What advantages will i see here ?

numb3r3

Jina AI org 1 day ago

I hope this post will answer your question https://jina.ai/news/readerlm-v2-frontier-small-language-model-for-html-to-markdown-and-json

TL;DR: the structure of HTML is reserved well, and excelling at generating complex elements like code fences, nested lists, tables and LaTex equations.

NickyNicky

about 24 hours ago

I think it's a great model to use in the future. I understand that for now the algorithmic way of extracting html wins but I think they are demonstrating the capabilities of what an LLMs could do without the algorithm.

I liked the model, do you plan to extract the dataset from html to markdown and json?

Thank you very much.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment