<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[Zhipu AI Open-Sources Visual Language Model CogAgent with GUI Interface Q&amp;A Support]]></title><description><![CDATA[<p dir="auto">Zhipu AI has open-sourced CogAgent, a visual language model with 18 billion parameters. The model excels in GUI understanding and navigation, achieving state-of-the-art (SOTA) general performance on multiple benchmarks.</p>
<p dir="auto">It also supports high-resolution visual input and conversational Q&amp;A, and can perform question-answering on arbitrary GUI screenshots.</p>
<p dir="auto"><img src="https://s3-sg.ufileos.com/nodebb-test/spider_image/c11f744c-c742-464b-a188-5229d8bfe7e7.png" alt="WeChat Screenshot_20231221083343.png" class=" img-fluid img-markdown" /></p>
<p dir="auto">The model can perform task inference by uploading screenshots, returning plans, next actions, and specific operation coordinates.</p>
<p dir="auto">CogAgent also supports OCR-related tasks, with significantly improved capabilities through pre-training and fine-tuning.</p>
<p dir="auto"><strong>Github:</strong></p>
<p dir="auto"><a href="https://github.com/CogNLP/CogAGENT" rel="nofollow ugc">https://github.com/CogNLP/CogAGENT</a></p>
<p dir="auto"><strong>cogagent-chat:</strong></p>
<p dir="auto"><a href="https://modelscope.cn/models/ZhipuAI/cogagent-chat/summary" rel="nofollow ugc">https://modelscope.cn/models/ZhipuAI/cogagent-chat/summary</a></p>
<p dir="auto"><strong>cogagent-vqa:</strong></p>
<p dir="auto"><a href="https://www.modelscope.cn/models/ZhipuAI/cogagent-vqa/summary" rel="nofollow ugc">https://www.modelscope.cn/models/ZhipuAI/cogagent-vqa/summary</a></p>
]]></description><link>https://iacommunidad.com/topic/1839/zhipu-ai-open-sources-visual-language-model-cogagent-with-gui-interface-q-a-support</link><generator>RSS for Node</generator><lastBuildDate>Tue, 16 Jun 2026 15:50:55 GMT</lastBuildDate><atom:link href="https://iacommunidad.com/topic/1839.rss" rel="self" type="application/rss+xml"/><pubDate>Mon, 08 Sep 2025 03:02:57 GMT</pubDate><ttl>60</ttl></channel></rss>