Telegram Group & Telegram Channel
πŸ”₯ Π Π΅Π»ΠΈΠ· Qwen 3 ΠΎΡ‚ Alibaba

Π’ Ρ€Π΅Π»ΠΈΠ· вошли 2 MoE-ΠΌΠΎΠ΄Π΅Π»ΠΈ ΠΈ 6 Dense models (ΠΏΠ»ΠΎΡ‚Π½Ρ‹Π΅ ΠΌΠΎΠ΄Π΅Π»ΠΈ), Ρ€Π°Π·ΠΌΠ΅Ρ€ΠΎΠΌ ΠΎΡ‚ 0.6B Π΄ΠΎ 235B ΠΏΠ°Ρ€Π°ΠΌΠ΅Ρ‚Ρ€ΠΎΠ².

πŸ† Ѐлагманская модСль Qwen3-235B-A22B дСмонстрируСт ΠΊΠΎΠ½ΠΊΡƒΡ€Π΅Π½Ρ‚Π½Ρ‹Π΅ Ρ€Π΅Π·ΡƒΠ»ΡŒΡ‚Π°Ρ‚Ρ‹ Π² Π·Π°Π΄Π°Ρ‡Π°Ρ… Кодина, ΠΌΠ°Ρ‚Π΅ΠΌΠ°Ρ‚ΠΈΠΊΠΈ ΠΈ ΠΎΠ±Ρ‰ΠΈΡ… способностСй, ΡƒΠ²Π΅Ρ€Π΅Π½Π½ΠΎ сопСрничая с ΠΏΠ΅Ρ€Π΅Π΄ΠΎΠ²Ρ‹ΠΌΠΈ модСлями, Ρ‚Π°ΠΊΠΈΠΌΠΈ ΠΊΠ°ΠΊ DeepSeek-R1, o1, o3-mini, Grok-3 ΠΈ Gemini-2.5-Pro.
⚑ НСбольшая MoE-модСль Qwen3-30B-A3B прСвосходит QwQ-32B,  ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΡŽ Π² 10 Ρ€Π°Π· мСньшС ΠΏΠ°Ρ€Π°ΠΌΠ΅Ρ‚Ρ€ΠΎΠ².
πŸ”₯ ΠšΠΎΠΌΠΏΠ°ΠΊΡ‚Π½Π°Ρ модСль Qwen3-4B сопоставима ΠΏΠΎ ΠΏΡ€ΠΎΠΈΠ·Π²ΠΎΠ΄ΠΈΡ‚Π΅Π»ΡŒΠ½ΠΎΡΡ‚ΠΈ с Qwen2.5-72B-Instruct.
🧠 ΠŸΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΈΠ²Π°Π΅Ρ‚ Π³ΠΈΠ±Ρ€ΠΈΠ΄Π½Ρ‹ΠΉ Ρ€Π΅ΠΆΠΈΠΌ ΠΌΡ‹ΡˆΠ»Π΅Π½ΠΈΡ

Π Π΅ΠΆΠΈΠΌ Ρ€Π°Π·ΠΌΡ‹ΡˆΠ»Π΅Π½ΠΈΡ активируСтся ΠΏΡ€ΠΈ ΠΎΠ±Ρ€Π°Π±ΠΎΡ‚ΠΊΠ΅ слоТных Π·Π°Π΄Π°Ρ‡, обСспСчивая ΠΏΠΎΡˆΠ°Π³ΠΎΠ²Ρ‹ΠΉ Π°Π½Π°Π»ΠΈΠ· запроса ΠΈ Ρ„ΠΎΡ€ΠΌΠΈΡ€ΠΎΠ²Π°Π½ΠΈΠ΅ комплСксных, Π³Π»ΡƒΠ±ΠΎΠΊΠΈΡ… ΠΎΡ‚Π²Π΅Ρ‚ΠΎΠ².

Π‘Π°Π·ΠΎΠ²Ρ‹ΠΉ Ρ€Π΅ΠΆΠΈΠΌ ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΠ΅Ρ‚ΡΡ для повсСднСвных вопросов, позволяя Π²Ρ‹Π΄Π°Π²Π°Ρ‚ΡŒ быстрыС ΠΈ Ρ‚ΠΎΡ‡Π½Ρ‹Π΅ ΠΎΡ‚Π²Π΅Ρ‚Ρ‹ с минимальной Π·Π°Π΄Π΅Ρ€ΠΆΠΊΠΎΠΉ.

ΠŸΡ€ΠΎΡ†Π΅ΡΡ обучСния ΠΌΠΎΠ΄Π΅Π»ΠΈ устроСн ΠΏΠΎΡ…ΠΎΠΆΠΈΠΌ ΠΎΠ±Ρ€Π°Π·ΠΎΠΌ Π½Π° Ρ‚ΠΎ, ΠΊΠ°ΠΊ это сдСлано Π² DeepSeek R1.

ΠŸΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΈΠ²Π°Π΅Ρ‚ 119 языков, Π²ΠΊΠ»ΡŽΡ‡Π°Ρ русский.

Π›ΠΈΡ†Π΅Π½Π·ΠΈΡ€ΠΎΠ²Π°Π½ΠΈΠ΅: Apache 2.0 πŸ”₯

πŸ”œΠŸΠΎΠΏΡ€ΠΎΠ±ΠΎΠ²Π°Ρ‚ΡŒ: https://chat.qwen.ai/
πŸ”œBlog: https://qwenlm.github.io/blog/qwen3/
πŸ”œGitHub: https://github.com/QwenLM/Qwen3
πŸ”œHugging Face: https://huggingface.co/collections/Qwen/qwen3-67dd247413f0e2e4f653967f
πŸ”œ ModelScope: https://modelscope.cn/collections/Qwen3-9743180bdc6b48

@ai_machinelearning_big_data

#Qwen
Please open Telegram to view this post
VIEW IN TELEGRAM
Please open Telegram to view this post
VIEW IN TELEGRAM



group-telegram.com/ai_machinelearning_big_data/7469
Create:
Last Update:

πŸ”₯ Π Π΅Π»ΠΈΠ· Qwen 3 ΠΎΡ‚ Alibaba

Π’ Ρ€Π΅Π»ΠΈΠ· вошли 2 MoE-ΠΌΠΎΠ΄Π΅Π»ΠΈ ΠΈ 6 Dense models (ΠΏΠ»ΠΎΡ‚Π½Ρ‹Π΅ ΠΌΠΎΠ΄Π΅Π»ΠΈ), Ρ€Π°Π·ΠΌΠ΅Ρ€ΠΎΠΌ ΠΎΡ‚ 0.6B Π΄ΠΎ 235B ΠΏΠ°Ρ€Π°ΠΌΠ΅Ρ‚Ρ€ΠΎΠ².

πŸ† Ѐлагманская модСль Qwen3-235B-A22B дСмонстрируСт ΠΊΠΎΠ½ΠΊΡƒΡ€Π΅Π½Ρ‚Π½Ρ‹Π΅ Ρ€Π΅Π·ΡƒΠ»ΡŒΡ‚Π°Ρ‚Ρ‹ Π² Π·Π°Π΄Π°Ρ‡Π°Ρ… Кодина, ΠΌΠ°Ρ‚Π΅ΠΌΠ°Ρ‚ΠΈΠΊΠΈ ΠΈ ΠΎΠ±Ρ‰ΠΈΡ… способностСй, ΡƒΠ²Π΅Ρ€Π΅Π½Π½ΠΎ сопСрничая с ΠΏΠ΅Ρ€Π΅Π΄ΠΎΠ²Ρ‹ΠΌΠΈ модСлями, Ρ‚Π°ΠΊΠΈΠΌΠΈ ΠΊΠ°ΠΊ DeepSeek-R1, o1, o3-mini, Grok-3 ΠΈ Gemini-2.5-Pro.
⚑ НСбольшая MoE-модСль Qwen3-30B-A3B прСвосходит QwQ-32B,  ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΡŽ Π² 10 Ρ€Π°Π· мСньшС ΠΏΠ°Ρ€Π°ΠΌΠ΅Ρ‚Ρ€ΠΎΠ².
πŸ”₯ ΠšΠΎΠΌΠΏΠ°ΠΊΡ‚Π½Π°Ρ модСль Qwen3-4B сопоставима ΠΏΠΎ ΠΏΡ€ΠΎΠΈΠ·Π²ΠΎΠ΄ΠΈΡ‚Π΅Π»ΡŒΠ½ΠΎΡΡ‚ΠΈ с Qwen2.5-72B-Instruct.
🧠 ΠŸΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΈΠ²Π°Π΅Ρ‚ Π³ΠΈΠ±Ρ€ΠΈΠ΄Π½Ρ‹ΠΉ Ρ€Π΅ΠΆΠΈΠΌ ΠΌΡ‹ΡˆΠ»Π΅Π½ΠΈΡ

Π Π΅ΠΆΠΈΠΌ Ρ€Π°Π·ΠΌΡ‹ΡˆΠ»Π΅Π½ΠΈΡ активируСтся ΠΏΡ€ΠΈ ΠΎΠ±Ρ€Π°Π±ΠΎΡ‚ΠΊΠ΅ слоТных Π·Π°Π΄Π°Ρ‡, обСспСчивая ΠΏΠΎΡˆΠ°Π³ΠΎΠ²Ρ‹ΠΉ Π°Π½Π°Π»ΠΈΠ· запроса ΠΈ Ρ„ΠΎΡ€ΠΌΠΈΡ€ΠΎΠ²Π°Π½ΠΈΠ΅ комплСксных, Π³Π»ΡƒΠ±ΠΎΠΊΠΈΡ… ΠΎΡ‚Π²Π΅Ρ‚ΠΎΠ².

Π‘Π°Π·ΠΎΠ²Ρ‹ΠΉ Ρ€Π΅ΠΆΠΈΠΌ ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΠ΅Ρ‚ΡΡ для повсСднСвных вопросов, позволяя Π²Ρ‹Π΄Π°Π²Π°Ρ‚ΡŒ быстрыС ΠΈ Ρ‚ΠΎΡ‡Π½Ρ‹Π΅ ΠΎΡ‚Π²Π΅Ρ‚Ρ‹ с минимальной Π·Π°Π΄Π΅Ρ€ΠΆΠΊΠΎΠΉ.

ΠŸΡ€ΠΎΡ†Π΅ΡΡ обучСния ΠΌΠΎΠ΄Π΅Π»ΠΈ устроСн ΠΏΠΎΡ…ΠΎΠΆΠΈΠΌ ΠΎΠ±Ρ€Π°Π·ΠΎΠΌ Π½Π° Ρ‚ΠΎ, ΠΊΠ°ΠΊ это сдСлано Π² DeepSeek R1.

ΠŸΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΈΠ²Π°Π΅Ρ‚ 119 языков, Π²ΠΊΠ»ΡŽΡ‡Π°Ρ русский.

Π›ΠΈΡ†Π΅Π½Π·ΠΈΡ€ΠΎΠ²Π°Π½ΠΈΠ΅: Apache 2.0 πŸ”₯

πŸ”œΠŸΠΎΠΏΡ€ΠΎΠ±ΠΎΠ²Π°Ρ‚ΡŒ: https://chat.qwen.ai/
πŸ”œBlog: https://qwenlm.github.io/blog/qwen3/
πŸ”œGitHub: https://github.com/QwenLM/Qwen3
πŸ”œHugging Face: https://huggingface.co/collections/Qwen/qwen3-67dd247413f0e2e4f653967f
πŸ”œ ModelScope: https://modelscope.cn/collections/Qwen3-9743180bdc6b48

@ai_machinelearning_big_data

#Qwen

BY Machinelearning





Share with your friend now:
group-telegram.com/ai_machinelearning_big_data/7469

View MORE
Open in Telegram


Telegram | DID YOU KNOW?

Date: |

Telegram has become more interventionist over time, and has steadily increased its efforts to shut down these accounts. But this has also meant that the company has also engaged with lawmakers more generally, although it maintains that it doesn’t do so willingly. For instance, in September 2021, Telegram reportedly blocked a chat bot in support of (Putin critic) Alexei Navalny during Russia’s most recent parliamentary elections. Pavel Durov was quoted at the time saying that the company was obliged to follow a β€œlegitimate” law of the land. He added that as Apple and Google both follow the law, to violate it would give both platforms a reason to boot the messenger from its stores. On December 23rd, 2020, Pavel Durov posted to his channel that the company would need to start generating revenue. In early 2021, he added that any advertising on the platform would not use user data for targeting, and that it would be focused on β€œlarge one-to-many channels.” He pledged that ads would be β€œnon-intrusive” and that most users would simply not notice any change. The regulator said it has been undertaking several campaigns to educate the investors to be vigilant while taking investment decisions based on stock tips. One thing that Telegram now offers to all users is the ability to β€œdisappear” messages or set remote deletion deadlines. That enables users to have much more control over how long people can access what you’re sending them. Given that Russian law enforcement officials are reportedly (via Insider) stopping people in the street and demanding to read their text messages, this could be vital to protect individuals from reprisals. This provided opportunity to their linked entities to offload their shares at higher prices and make significant profits at the cost of unsuspecting retail investors.
from nl


Telegram Machinelearning
FROM American