Telegram Group & Telegram Channel
πŸ”₯ Π Π΅Π»ΠΈΠ· Qwen 3 ΠΎΡ‚ Alibaba

Π’ Ρ€Π΅Π»ΠΈΠ· вошли 2 MoE-ΠΌΠΎΠ΄Π΅Π»ΠΈ ΠΈ 6 Dense models (ΠΏΠ»ΠΎΡ‚Π½Ρ‹Π΅ ΠΌΠΎΠ΄Π΅Π»ΠΈ), Ρ€Π°Π·ΠΌΠ΅Ρ€ΠΎΠΌ ΠΎΡ‚ 0.6B Π΄ΠΎ 235B ΠΏΠ°Ρ€Π°ΠΌΠ΅Ρ‚Ρ€ΠΎΠ².

πŸ† Ѐлагманская модСль Qwen3-235B-A22B дСмонстрируСт ΠΊΠΎΠ½ΠΊΡƒΡ€Π΅Π½Ρ‚Π½Ρ‹Π΅ Ρ€Π΅Π·ΡƒΠ»ΡŒΡ‚Π°Ρ‚Ρ‹ Π² Π·Π°Π΄Π°Ρ‡Π°Ρ… Кодина, ΠΌΠ°Ρ‚Π΅ΠΌΠ°Ρ‚ΠΈΠΊΠΈ ΠΈ ΠΎΠ±Ρ‰ΠΈΡ… способностСй, ΡƒΠ²Π΅Ρ€Π΅Π½Π½ΠΎ сопСрничая с ΠΏΠ΅Ρ€Π΅Π΄ΠΎΠ²Ρ‹ΠΌΠΈ модСлями, Ρ‚Π°ΠΊΠΈΠΌΠΈ ΠΊΠ°ΠΊ DeepSeek-R1, o1, o3-mini, Grok-3 ΠΈ Gemini-2.5-Pro.
⚑ НСбольшая MoE-модСль Qwen3-30B-A3B прСвосходит QwQ-32B,  ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΡŽ Π² 10 Ρ€Π°Π· мСньшС ΠΏΠ°Ρ€Π°ΠΌΠ΅Ρ‚Ρ€ΠΎΠ².
πŸ”₯ ΠšΠΎΠΌΠΏΠ°ΠΊΡ‚Π½Π°Ρ модСль Qwen3-4B сопоставима ΠΏΠΎ ΠΏΡ€ΠΎΠΈΠ·Π²ΠΎΠ΄ΠΈΡ‚Π΅Π»ΡŒΠ½ΠΎΡΡ‚ΠΈ с Qwen2.5-72B-Instruct.
🧠 ΠŸΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΈΠ²Π°Π΅Ρ‚ Π³ΠΈΠ±Ρ€ΠΈΠ΄Π½Ρ‹ΠΉ Ρ€Π΅ΠΆΠΈΠΌ ΠΌΡ‹ΡˆΠ»Π΅Π½ΠΈΡ

Π Π΅ΠΆΠΈΠΌ Ρ€Π°Π·ΠΌΡ‹ΡˆΠ»Π΅Π½ΠΈΡ активируСтся ΠΏΡ€ΠΈ ΠΎΠ±Ρ€Π°Π±ΠΎΡ‚ΠΊΠ΅ слоТных Π·Π°Π΄Π°Ρ‡, обСспСчивая ΠΏΠΎΡˆΠ°Π³ΠΎΠ²Ρ‹ΠΉ Π°Π½Π°Π»ΠΈΠ· запроса ΠΈ Ρ„ΠΎΡ€ΠΌΠΈΡ€ΠΎΠ²Π°Π½ΠΈΠ΅ комплСксных, Π³Π»ΡƒΠ±ΠΎΠΊΠΈΡ… ΠΎΡ‚Π²Π΅Ρ‚ΠΎΠ².

Π‘Π°Π·ΠΎΠ²Ρ‹ΠΉ Ρ€Π΅ΠΆΠΈΠΌ ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΠ΅Ρ‚ΡΡ для повсСднСвных вопросов, позволяя Π²Ρ‹Π΄Π°Π²Π°Ρ‚ΡŒ быстрыС ΠΈ Ρ‚ΠΎΡ‡Π½Ρ‹Π΅ ΠΎΡ‚Π²Π΅Ρ‚Ρ‹ с минимальной Π·Π°Π΄Π΅Ρ€ΠΆΠΊΠΎΠΉ.

ΠŸΡ€ΠΎΡ†Π΅ΡΡ обучСния ΠΌΠΎΠ΄Π΅Π»ΠΈ устроСн ΠΏΠΎΡ…ΠΎΠΆΠΈΠΌ ΠΎΠ±Ρ€Π°Π·ΠΎΠΌ Π½Π° Ρ‚ΠΎ, ΠΊΠ°ΠΊ это сдСлано Π² DeepSeek R1.

ΠŸΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΈΠ²Π°Π΅Ρ‚ 119 языков, Π²ΠΊΠ»ΡŽΡ‡Π°Ρ русский.

Π›ΠΈΡ†Π΅Π½Π·ΠΈΡ€ΠΎΠ²Π°Π½ΠΈΠ΅: Apache 2.0 πŸ”₯

πŸ”œΠŸΠΎΠΏΡ€ΠΎΠ±ΠΎΠ²Π°Ρ‚ΡŒ: https://chat.qwen.ai/
πŸ”œBlog: https://qwenlm.github.io/blog/qwen3/
πŸ”œGitHub: https://github.com/QwenLM/Qwen3
πŸ”œHugging Face: https://huggingface.co/collections/Qwen/qwen3-67dd247413f0e2e4f653967f
πŸ”œ ModelScope: https://modelscope.cn/collections/Qwen3-9743180bdc6b48

@ai_machinelearning_big_data

#Qwen
Please open Telegram to view this post
VIEW IN TELEGRAM
Please open Telegram to view this post
VIEW IN TELEGRAM



group-telegram.com/ai_machinelearning_big_data/7469
Create:
Last Update:

πŸ”₯ Π Π΅Π»ΠΈΠ· Qwen 3 ΠΎΡ‚ Alibaba

Π’ Ρ€Π΅Π»ΠΈΠ· вошли 2 MoE-ΠΌΠΎΠ΄Π΅Π»ΠΈ ΠΈ 6 Dense models (ΠΏΠ»ΠΎΡ‚Π½Ρ‹Π΅ ΠΌΠΎΠ΄Π΅Π»ΠΈ), Ρ€Π°Π·ΠΌΠ΅Ρ€ΠΎΠΌ ΠΎΡ‚ 0.6B Π΄ΠΎ 235B ΠΏΠ°Ρ€Π°ΠΌΠ΅Ρ‚Ρ€ΠΎΠ².

πŸ† Ѐлагманская модСль Qwen3-235B-A22B дСмонстрируСт ΠΊΠΎΠ½ΠΊΡƒΡ€Π΅Π½Ρ‚Π½Ρ‹Π΅ Ρ€Π΅Π·ΡƒΠ»ΡŒΡ‚Π°Ρ‚Ρ‹ Π² Π·Π°Π΄Π°Ρ‡Π°Ρ… Кодина, ΠΌΠ°Ρ‚Π΅ΠΌΠ°Ρ‚ΠΈΠΊΠΈ ΠΈ ΠΎΠ±Ρ‰ΠΈΡ… способностСй, ΡƒΠ²Π΅Ρ€Π΅Π½Π½ΠΎ сопСрничая с ΠΏΠ΅Ρ€Π΅Π΄ΠΎΠ²Ρ‹ΠΌΠΈ модСлями, Ρ‚Π°ΠΊΠΈΠΌΠΈ ΠΊΠ°ΠΊ DeepSeek-R1, o1, o3-mini, Grok-3 ΠΈ Gemini-2.5-Pro.
⚑ НСбольшая MoE-модСль Qwen3-30B-A3B прСвосходит QwQ-32B,  ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΡŽ Π² 10 Ρ€Π°Π· мСньшС ΠΏΠ°Ρ€Π°ΠΌΠ΅Ρ‚Ρ€ΠΎΠ².
πŸ”₯ ΠšΠΎΠΌΠΏΠ°ΠΊΡ‚Π½Π°Ρ модСль Qwen3-4B сопоставима ΠΏΠΎ ΠΏΡ€ΠΎΠΈΠ·Π²ΠΎΠ΄ΠΈΡ‚Π΅Π»ΡŒΠ½ΠΎΡΡ‚ΠΈ с Qwen2.5-72B-Instruct.
🧠 ΠŸΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΈΠ²Π°Π΅Ρ‚ Π³ΠΈΠ±Ρ€ΠΈΠ΄Π½Ρ‹ΠΉ Ρ€Π΅ΠΆΠΈΠΌ ΠΌΡ‹ΡˆΠ»Π΅Π½ΠΈΡ

Π Π΅ΠΆΠΈΠΌ Ρ€Π°Π·ΠΌΡ‹ΡˆΠ»Π΅Π½ΠΈΡ активируСтся ΠΏΡ€ΠΈ ΠΎΠ±Ρ€Π°Π±ΠΎΡ‚ΠΊΠ΅ слоТных Π·Π°Π΄Π°Ρ‡, обСспСчивая ΠΏΠΎΡˆΠ°Π³ΠΎΠ²Ρ‹ΠΉ Π°Π½Π°Π»ΠΈΠ· запроса ΠΈ Ρ„ΠΎΡ€ΠΌΠΈΡ€ΠΎΠ²Π°Π½ΠΈΠ΅ комплСксных, Π³Π»ΡƒΠ±ΠΎΠΊΠΈΡ… ΠΎΡ‚Π²Π΅Ρ‚ΠΎΠ².

Π‘Π°Π·ΠΎΠ²Ρ‹ΠΉ Ρ€Π΅ΠΆΠΈΠΌ ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΠ΅Ρ‚ΡΡ для повсСднСвных вопросов, позволяя Π²Ρ‹Π΄Π°Π²Π°Ρ‚ΡŒ быстрыС ΠΈ Ρ‚ΠΎΡ‡Π½Ρ‹Π΅ ΠΎΡ‚Π²Π΅Ρ‚Ρ‹ с минимальной Π·Π°Π΄Π΅Ρ€ΠΆΠΊΠΎΠΉ.

ΠŸΡ€ΠΎΡ†Π΅ΡΡ обучСния ΠΌΠΎΠ΄Π΅Π»ΠΈ устроСн ΠΏΠΎΡ…ΠΎΠΆΠΈΠΌ ΠΎΠ±Ρ€Π°Π·ΠΎΠΌ Π½Π° Ρ‚ΠΎ, ΠΊΠ°ΠΊ это сдСлано Π² DeepSeek R1.

ΠŸΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΈΠ²Π°Π΅Ρ‚ 119 языков, Π²ΠΊΠ»ΡŽΡ‡Π°Ρ русский.

Π›ΠΈΡ†Π΅Π½Π·ΠΈΡ€ΠΎΠ²Π°Π½ΠΈΠ΅: Apache 2.0 πŸ”₯

πŸ”œΠŸΠΎΠΏΡ€ΠΎΠ±ΠΎΠ²Π°Ρ‚ΡŒ: https://chat.qwen.ai/
πŸ”œBlog: https://qwenlm.github.io/blog/qwen3/
πŸ”œGitHub: https://github.com/QwenLM/Qwen3
πŸ”œHugging Face: https://huggingface.co/collections/Qwen/qwen3-67dd247413f0e2e4f653967f
πŸ”œ ModelScope: https://modelscope.cn/collections/Qwen3-9743180bdc6b48

@ai_machinelearning_big_data

#Qwen

BY Machinelearning





Share with your friend now:
group-telegram.com/ai_machinelearning_big_data/7469

View MORE
Open in Telegram


Telegram | DID YOU KNOW?

Date: |

"Your messages about the movement of the enemy through the official chatbot … bring new trophies every day," the government agency tweeted. DFR Lab sent the image through Microsoft Azure's Face Verification program and found that it was "highly unlikely" that the person in the second photo was the same as the first woman. The fact-checker Logically AI also found the claim to be false. The woman, Olena Kurilo, was also captured in a video after the airstrike and shown to have the injuries. Either way, Durov says that he withdrew his resignation but that he was ousted from his company anyway. Subsequently, control of the company was reportedly handed to oligarchs Alisher Usmanov and Igor Sechin, both allegedly close associates of Russian leader Vladimir Putin. Given the pro-privacy stance of the platform, it’s taken as a given that it’ll be used for a number of reasons, not all of them good. And Telegram has been attached to a fair few scandals related to terrorism, sexual exploitation and crime. Back in 2015, Vox described Telegram as β€œISIS’ app of choice,” saying that the platform’s real use is the ability to use channels to distribute material to large groups at once. Telegram has acted to remove public channels affiliated with terrorism, but Pavel Durov reiterated that he had no business snooping on private conversations. Continuing its crackdown against entities allegedly involved in a front-running scam using messaging app Telegram, Sebi on Thursday carried out search and seizure operations at the premises of eight entities in multiple locations across the country.
from jp


Telegram Machinelearning
FROM American