Skip to content
  • Categories
  • Newsletter
  • Recent
  • AI Insights
  • Tags
  • Popular
  • World
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
  1. Home
  2. AI Insights
  3. Tongyi Qianwen 72B Model Tops the Large Model Evaluation Platform Rankings
uSpeedo.ai - AI marketing assistant
Try uSpeedo.ai — Boost your marketing

Tongyi Qianwen 72B Model Tops the Large Model Evaluation Platform Rankings

Scheduled Pinned Locked Moved AI Insights
techinteligencia-ar
1 Posts 1 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • baoshi.raoB Offline
    baoshi.raoB Offline
    baoshi.rao
    wrote on last edited by
    #1

    China's authoritative large model evaluation platform OpenCompass recently updated its rankings, with the Tongyi Qianwen 72B model taking the top spot with a high score of 67.1.

    OpenCompass is an open-source large model evaluation platform launched by the Shanghai Artificial Intelligence Laboratory. Its evaluation scope covers five dimensions: disciplines, language, knowledge, comprehension, and reasoning, providing a comprehensive assessment of large model capabilities.

    In OpenCompass's Chinese dataset evaluations, the Qwen-72B base large model and the conversational large model (Qwen-72B-Chat) secured the top two positions, significantly outperforming other models.

    WeChat Screenshot_20231213113631.png

    In early December, Alibaba Cloud announced the open-sourcing of its 72-billion-parameter large language model, Qwen-72B. Qwen-72B has achieved the best performance among open-source models in ten authoritative benchmark evaluations, making it the most powerful open-source large model in the industry. Its performance surpasses the open-source benchmark model Llama2-70B and most commercial closed-source models, making it suitable for enterprise-level and research-level high-performance applications.

    It is reported that Qwen-72B can process text inputs of up to 32k in length and outperforms ChatGPT-3.5-16k on the long-text understanding test set LEval.

    1 Reply Last reply
    0
    Reply
    • Reply as topic
    Log in to reply
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes


    • Login

    • Don't have an account? Register

    • Login or register to search.
    • First post
      Last post
    0
    • Categories
    • Newsletter
    • Recent
    • AI Insights
    • Tags
    • Popular
    • World
    • Groups