Telegram Group & Telegram Channel
2024-december-transformers.png
904.2 KB
tasty ai papers | december 2024

1️⃣ Byte Latent Transformer: Patches Scale Better Than Tokens

what: train llama on raw bytes without a fixed vocabulary.
- dynamically patches bytes usign local small encoder
- main decoder process these patch in AR setting
- local deocder makes next byte prediction.
paper: https://arxiv.org/abs/2412.09871

2️⃣ Large Concept Models: Language Modeling in a Sentence Representation Space

what: work with entire sentences as "concepts" through SONAR embeddings.
- quite similar with the first paper here, but it merges tokens into high dim embeddings
- working with sentence-level embeddings directly.

paper: https://arxiv.org/abs/2412.08821

3️⃣ GenCast predicts weather and the risks of extreme conditions with state-of-the-art accuracy

what: Created a diffusion model for probabilistic weather forecasting that generates 15-day predictions with 12-hour steps
how:
- It aggregates two previous timesteps to predict the next weather state
- Instead of directly sampling weather state, it generates residuals (differences) relative to the previous state.
- Артемий в канале AI для Всех сделал ревью на русском, почитайте.

paper: https://www.nature.com/articles/s41586-024-08252-9

my thoughts:
Looks like we're finally getting closer to how humans actually process language, not just crunching tokens like robots. Whether it's patching bytes or bundling tokens into sentence embeddings, this hierarchical approach seems to be the way forward.
GenCast - is just super interesting adoption of modern AI to real problems in natural science.
Please open Telegram to view this post
VIEW IN TELEGRAM



group-telegram.com/neural_cell/225
Create:
Last Update:

tasty ai papers | december 2024

1️⃣ Byte Latent Transformer: Patches Scale Better Than Tokens

what: train llama on raw bytes without a fixed vocabulary.
- dynamically patches bytes usign local small encoder
- main decoder process these patch in AR setting
- local deocder makes next byte prediction.
paper: https://arxiv.org/abs/2412.09871

2️⃣ Large Concept Models: Language Modeling in a Sentence Representation Space

what: work with entire sentences as "concepts" through SONAR embeddings.
- quite similar with the first paper here, but it merges tokens into high dim embeddings
- working with sentence-level embeddings directly.

paper: https://arxiv.org/abs/2412.08821

3️⃣ GenCast predicts weather and the risks of extreme conditions with state-of-the-art accuracy

what: Created a diffusion model for probabilistic weather forecasting that generates 15-day predictions with 12-hour steps
how:
- It aggregates two previous timesteps to predict the next weather state
- Instead of directly sampling weather state, it generates residuals (differences) relative to the previous state.
- Артемий в канале AI для Всех сделал ревью на русском, почитайте.

paper: https://www.nature.com/articles/s41586-024-08252-9

my thoughts:
Looks like we're finally getting closer to how humans actually process language, not just crunching tokens like robots. Whether it's patching bytes or bundling tokens into sentence embeddings, this hierarchical approach seems to be the way forward.
GenCast - is just super interesting adoption of modern AI to real problems in natural science.

BY the last neural cell


Warning: Undefined variable $i in /var/www/group-telegram/post.php on line 260

Share with your friend now:
group-telegram.com/neural_cell/225

View MORE
Open in Telegram


Telegram | DID YOU KNOW?

Date: |

In addition, Telegram's architecture limits the ability to slow the spread of false information: the lack of a central public feed, and the fact that comments are easily disabled in channels, reduce the space for public pushback. However, the perpetrators of such frauds are now adopting new methods and technologies to defraud the investors. "Your messages about the movement of the enemy through the official chatbot … bring new trophies every day," the government agency tweeted. "Markets were cheering this economic recovery and return to strong economic growth, but the cheers will turn to tears if the inflation outbreak pushes businesses and consumers to the brink of recession," he added. As the war in Ukraine rages, the messaging app Telegram has emerged as the go-to place for unfiltered live war updates for both Ukrainian refugees and increasingly isolated Russians alike.
from sg


Telegram the last neural cell
FROM American