AI Guide

AI & Tech

DeepSeek Applications and Strategic Thinking

DeepSeek is now being smoothly applied in cross-border e-commerce.
The success of "Nezha 2" and DeepSeek shares a venture capital-style mindset. In the face of complex and rapidly changing environments, enterprises and individuals should learn from the venture capital industry, actively embrace uncertainty, dare to take risks, not be afraid of failure, and be brave to innovate.

DeepSeek's Open-Source Matrix Multiplication Library: DeepGEMM

DeepSeek has open-sourced DeepGEMM, a matrix multiplication library optimized for the Hopper architecture GPU, supporting standard matrix calculations and Mixture of Experts (MoE) calculations.
This library provides strong support for the training and inference of DeepSeek-V3/R1.
The core code is only about 300 lines and outperforms existing solutions in most matrix sizes.
DeepGEMM utilizes HopperGPU's tensor cores and TMA technology to optimize FP8 precision matrix multiplication.
In standard matrix multiplication, DeepGEMM is 1.0 to 2.7 times faster than optimized implementations based on CUTLASS3.6. In Mixture of Experts model calculations, continuous arrangement is accelerated by about 1.1 to 1.2 times, and mask arrangement is also accelerated by 1.1 to 1.2 times.

News List

36kr.AI

1 hours ago

CEO锦囊·出海季：当跨境电商遇上DeepSeek，赚麻了？

DeepSeek已经丝滑地应用到跨境电商中了。

虎嗅.最新

1 hours ago

一文详解：DeepSeek刚开源的DeepGEMM是怎么回事？

DeepSeek开源了专为Hopper架构GPU优化的矩阵乘法库DeepGEMM，支持标准矩阵计算和混合专家模型（MoE）计算，为DeepSeek-V3/R1的训练和推理提供强大支持。该库核心代码仅约300行，在大多数矩阵尺寸下性能优于现有解决方案，采用即时编译技术，无需安装时编译，代码结构清晰易懂。在标准矩阵乘法中，DeepGEMM相比基于CUTLASS3.6的优化实现，速度提升1.0到2.7倍不等；在混合专家模型计算中，连续排列方式提速约1.1到1.2倍，掩码排列方式也能提速1.1到1.2倍。DeepGEMM利用HopperGPU的张量核心和TMA技术，优化FP8精度矩阵乘法，确保计算结果准确可靠。

虎嗅.最新

1 hours ago

风投式思维：哪吒2和DeepSeek背后的共同思维模式

《哪吒2》和DeepSeek的成功，共享着风投式思维模式。传统上经营企业和研发产品的那种渐进、连续且保守的方式早已不再适用，面对复杂且快速变化的环境，企业和个人都应该学习风险投资行业，积极拥抱不确定性，敢于承担风险，不惧失败，勇于创新。风投式思维具有接受不确定性、鼓励创新、将失败视为必需、大胆决策、保持敏捷行动、注重长期发展等核心特征，根据这种思维原则，失败虽有成本，但和一旦成功所获得的巨大收益相比，失败的成本其实非常低。

AI Guide

Tech Industry Updates

New Deep Learning Models Released

Several tech giants have recently launched new deep reasoning models. This surge in model development is indicative of the growing competition in the artificial intelligence sector, where firms are investing heavily in advanced technologies to enhance their offerings.

Exponential Financing and Valuation Increases

In conjunction with these new model releases, companies are experiencing excess financing, leading to a remarkable valuation increase of up to two times. The influx of capital reflects investors' confidence in the potential of AI and its applications across various industries.

Built-in JIT and Clean Implementation

The latest models feature built-in Just-In-Time (JIT) compilation capabilities, showcasing a commitment to efficiency and performance. The implementation is noted for its clean design, making it easier for developers to integrate and utilize these new technologies.

News List

36kr.AI

4 hours ago

DeepSeek开源第三弹，极致榨干GPU，FP8训推秘籍公开

内置JIT，像教程一样干净！

36kr.AI

4 hours ago

DeepSeek头号黑粉这下爽到了

发新模型，超额融资，估值增加两倍

36kr.AI

4 hours ago

传DeepSeek R2提速，字节豆包灰测深度思考，微软Copilot已免费开放

科技巨头扎堆上新深度推理模型。

AI Guide

AI

AI Model Applications

The focus is on AI large model applications.

Finance

Biren Technology IPO

Biren Technology is considering a Hong Kong IPO and plans to raise $300 million.

News List

36kr.AI

5 hours ago

DeepSeek风口，AI算力独角兽IPO提速？

壁仞科技考虑港股IPO，拟募资3亿美元。

36kr.AI

5 hours ago

拼多多AI大模型暗战

侧重AI大模型应用。

AI Guide

AI & Machine Learning

Open Source Trends in AI

The trend of AI open source is emerging, with inference models becoming mainstream.

DeepSeek's Impact and Future AI Development

Redpoint Ventures partner Jacob Effron and David Luan discussed DeepSeek's implications for the large model space, suggesting its success indicates a shift towards improving model efficiency. Luan anticipates that large "teacher models" will be trained internally and then compressed for client use. The increasing complexity of AI applications, from simple chat to drug discovery, requires more intelligent models, with reinforcement learning playing a key role in enhancing agent intelligence.

AI Search and Advertising

Controversy arose around AI search tool DeepSeek due to suspected advertising within its answers, showing bias in recommending purchasing channels. This has led to discussions about the possibility of advertising integration in AI search. While technically feasible and seen as valuable by advertisers, user acceptance, content accuracy, and privacy issues remain challenges. Domestic AI search platforms are cautious about incorporating ads.

News List

虎嗅.最新

6 hours ago

广告盯上DeepSeek

近日，AI搜索工具DeepSeek因答案中疑似“夹带”广告引发争议，用户发现其在推荐购买渠道时存在倾向性，甚至有商家借DeepSeek名义推销商品，可能误导用户。尽管腾讯否认了相关指控，但AI搜索中广告植入的可能性引发关注。技术上，AI搜索加广告不难实现，国外已有先例，如Perplexity AI。广告商看好AI搜索的精准性和用户价值，但用户接受度、内容准确性和隐私问题是潜在挑战。目前国内AI搜索平台对广告接入持谨慎态度，用户需自行辨别AI搜索结果中的广告内容。

36kr.AI

6 hours ago

DeepSeek 闯进更难的第二关

商业化之战

虎嗅.最新

6 hours ago

OpenAI早期员工David Luan最新访谈：DeepSeek并未改变AI技术的叙事

红点创投合伙人Jacob Effron与David Luan探讨了DeepSeek对大模型领域的启示，认为其成功表明AI发展重心已转向提升模型效率。Luan强调，即使模型更高效，对智能的追求也不会停止。他预测，未来大型“教师模型”将在内部实验室训练，再压缩成高效模型供客户使用。Luan还提到，人工智能的应用场景复杂度不断提升，从简单聊天到复杂任务如药物发现，都需要更智能的模型。他认为强化学习在提升Agent智能方面具有重要价值。

36kr.AI

6 hours ago

广告盯上DeepSeek

AI搜索加广告，是必然吗？

36kr.AI

6 hours ago

DeepSeek开源周才第二天，有些公司就已经坐不住了

AI开源潮涌现，推理模型成主流。

AI Guide

AI Research Assistants

OpenAI Expands Deep Research AI Agent

OpenAI has broadened access to its powerful Deep Research AI agent, now available to ChatGPT Plus, Team, Education, and Enterprise users. This expansion is expected to heighten competition with other players like DeepSeek and Anthropic within the swiftly advancing AI research assistant market.

DeepSeek's Open Source Success and Commercialization Challenges

DeepSeek, having achieved a successful 1.0 phase as an open-source software, has attracted major tech companies such as Tencent, Baidu, and Alibaba. However, it is facing challenges in its commercialization strategy. Despite having substantial funding, the need for a solid business model is crucial for leveraging its technological advancements effectively. This analysis explores three business models for open-source technology:

Google's approach of “establishing rules” to control core services like GMS.
MySQL's model of offering an enterprise version alongside a partially open-source product.
The “open source + product/service” model exemplified by Chrome and RedHat, which may serve as a primary model for DeepSeek: generating revenue through services rather than direct model sales. The focus is on converting user numbers into value through branding and service offerings.

DeepSeek Launches Open Source Week

DeepSeek has kicked off its “Open Source Week,” which highlights the importance of open-source development and its implications. This initiative could hold significant relevance for developers and organizations engaged in open-source projects, fostering collaboration and innovation within the tech community.

News List

虎嗅.最新

7 hours ago

DeepSeek闯进更难的第二关

DeepSeek作为开源软件已实现1.0阶段的成功，吸引腾讯、百度、阿里等大厂接入。但其商业化模式面临挑战，尽管幻方不缺钱，但好的商业模式是技术飞轮的必要条件。文章分析了三种开源技术的商业模式：一是如谷歌安卓通过“建立规则”控制GMS核心服务；二是如MySQL通过部分开源提供企业版本；三是如Chrome和RedHat通过“开源+产品/服务”模式，其中Chrome和RedHat的模式可能成为DeepSeek的主要商业模式：模型不赚钱，用服务赚钱。通过品牌和服务，将用户数转化为价值。

36kr.AI

7 hours ago

DeepSeek开启“开源周”，与我们有什么关系？

VentureBeat

7 hours ago

OpenAI drops Deep Research access to Plus users, heating up AI agent wars with DeepSeek and Claude

OpenAI expands its powerful Deep Research AI agent to ChatGPT Plus, Team, Education, and Enterprise users, intensifying competition with DeepSeek and Anthropic in the rapidly evolving AI research assistant market.

AI Guide

Artificial Intelligence

OpenAI DeepResearch Expansion

OpenAI has broadened access to its DeepResearch AI agent, now available to ChatGPT Plus, Team, Edu, and Enterprise users. Pro users receive 120 deep research queries monthly, while other users get 10. DeepResearch, leveraging a version of the o3 model, conducts multi-step research by collecting web information and synthesizing it into detailed reports. It can search and interpret texts and images, generating reports in 5-30 minutes, complete with citations and summaries of its reasoning, a process OpenAI claims would take humans hours.

News List

ZDNet.AI

10 hours ago

OpenAI’s Deep Research can save you hours of work – and now it’s a lot cheaper to access

OpenAI has expanded access to its DeepResearch AI agent to ChatGPT Plus, Team, Edu, and Enterprise users, with Pro users having 120 deep research queries per month while the other users have 10 queries per month. Powered by a version of the o3 model, DeepResearch conducts multi-step research by gathering information from the web and synthesizing it into a comprehensive report. It can search and interpret texts and images and generate a report in 5-30 minutes, including citations and a summary of its thinking. OpenAI says this would take humans hours.

AI Guide

AI

OpenAI DeepResearch Expansion and Features

OpenAI is expanding its DeepResearch AI agent, initially for ChatGPT Pro, to Plus, Team, Edu, and Enterprise users. This tool enables ChatGPT to conduct in-depth research and synthesize information into comprehensive reports within 5-30 minutes, citing sources. Plus users receive 10 queries monthly, while Pro users get 120. The tool now includes embedded images with citations and improved document analysis.

OpenAI's Decision to Withhold DeepResearch API

OpenAI will not release the DeepResearch AI model via its developer API due to concerns about AI's persuasive capabilities and potential for spreading misinformation. The company is revising its methods for assessing “real-world persuasion risks.” Though deemed unsuitable for mass disinformation campaigns due to cost and speed, OpenAI will investigate factors like personalized persuasive content, as tests showed the model could write persuasive arguments, but not better than humans. DeepResearch is powered by a version of the o3 model.

News List

ZDNet.AI

11 hours ago

OpenAI’s Deep Research agent can do in 5 minutes what takes you hours – and now it’s a lot cheaper

OpenAI has launched DeepResearch, an AI agent in ChatGPT that conducts in-depth research and synthesizes information into comprehensive reports. Initially for ChatGPT Pro users, it’s now rolling out to Plus, Team, Edu, and Enterprise users, with varying query limits. Powered by a version of the o3 model, DeepResearch can analyze vast amounts of web content, including texts and images, and generate reports in 5-30 minutes. The reports include citations and summaries. OpenAI is withholding the model from its developer API to assess risks of AI convincing people.

TechCrunch

11 hours ago

Why OpenAI isn’t bringing deep research to its API just yet

OpenAI will not bring the AI model powering DeepResearch to its developer API due to concerns about AI’s ability to persuade and potentially spread misinformation. The company is revising its methods for assessing “real-world persuasion risks.” While OpenAI believes DeepResearch is not suited for mass disinformation campaigns due to its computing costs and speed, it intends to explore factors like personalized persuasive content. Tests showed the DeepResearch model performed well in writing persuasive arguments, but not better than humans.

Engadget

11 hours ago

OpenAI expands Deep Research to all paying ChatGPT users

OpenAI is rolling out its DeepResearch tool to ChatGPT Plus, Team, Edu, and Enterprise users, after initially launching it for Pro users. This feature allows ChatGPT to create in-depth reports on various subjects. Plus users will get 10 DeepResearch queries per month, while Pro subscribers now have 120. OpenAI has also improved the tool by embedding images with citations and enhancing document analysis. Users can access DeepResearch by tapping the icon before sending a request to OpenAI.

AI Guide

AI Development & Applications

AI Model Updates and Competition

Chinese AI startup DeepSeek has reopened its API after a three-week pause due to capacity issues. Their R1 "reasoning" model rivals OpenAI's. Meanwhile, Alibaba launched a preview of its QwQ-Max reasoning AI model, planning to open-source it, indicating growing competition in the Chinese AI market.

AI Tools in Research

The author shares their evolving perspective on AI tools, embracing them for research purposes. They use Ollama with the Llama3.2 LLM for quick answers and DeepSeekR1 with the Mysty GUI for in-depth research, significantly enhancing their research process.

Workplace Dynamics

The Importance of Workplace Relationships

Employees who feel connected and can collaborate are more likely to succeed and be happy. Social connections increase life satisfaction and well-being, while isolation can lead to burnout. Hybrid work models pose challenges to maintaining meaningful workplace relationships, requiring future solutions.

News List

The Verge

16 hours ago

How AI PCs are removing barriers to workplace connection

The importance of relationships in the workplace is highlighted, emphasizing that employees who feel connected and can collaborate are more likely to succeed and be happy. Research indicates that social connections increase life satisfaction and well-being, while isolation can lead to burnout. Employee happiness is linked to better work outcomes and success. The rise of hybrid work models poses challenges to maintaining meaningful workplace relationships, with most businesses planning to continue with remote work options. Many employees struggle to maintain meaningful connections in this new professional landscape, and future solutions need to address this issue.

ZDNet.AI

16 hours ago

I was an AI skeptic until until these 5 tools changed my mind

The author shares their evolving perspective on AI tools, moving from initial skepticism to embracing them for specific purposes like research. They highlight the efficiency of AI in quickly understanding complex concepts, replacing traditional search engines in their workflow. The author primarily uses Ollama, a command-line AI tool, with the Llama3.2 LLM for fast and concise answers. They also use DeepSeekR1 with the Mysty GUI for more in-depth research, particularly when exploring complex topics for creative projects. The combination of these tools has significantly enhanced their research process.

TechCrunch

16 hours ago

DeepSeek reopens access to its API after three-week pause

Chinese AI startup DeepSeek has reopened its API after a three-week halt due to capacity constraints. Customers can now top up credits to use DeepSeek’s AI, but server resources remain strained during daytime. DeepSeek gained prominence with its R1 “reasoning” model, rivaling OpenAI’s models, prompting OpenAI to consider open-sourcing more technology. Meanwhile, Chinese tech giant Alibaba launched a preview of its latest reasoning AI model, QwQ-Max, planning to open-source it, indicating increasing competition in the Chinese AI market.

AI Guide

AI Model Updates

Anthropic's Claude 3.7 Sonnet

Anthropic has launched its first "hybrid model," Claude 3.7 Sonnet, integrating real-time responses with deep-thinking capabilities, allowing users to obtain various types of answers without switching models. The model excels in following instructions, general reasoning, multi-modal capabilities, and autonomous coding, with significant improvements in mathematics and science. Its coding abilities surpass DeepSeek R1 and OpenAI's o1 and o3 models.

Agent Claude Code

Anthropic has also released Agent Claude Code, designed specifically for coding tasks. It can run directly in the terminal, assisting developers with programming tasks.

Claude 3.7 Sonnet Availability and Pricing

Claude 3.7 Sonnet is now fully available, although the extended thinking mode is not accessible for free users, and its pricing is higher than competitors' pure reasoning models.

Anthropic's Funding

Anthropic is reportedly nearing the completion of a new $3.5 billion funding round, potentially valuing the company at $61.5 billion.

News List

虎嗅.最新

17 hours ago

DeepSeek头号黑粉这下爽到了

Anthropic发布首个“混合模型”Claude3.7Sonnet，该模型整合实时应答和深度思考，用户无需切换即可获得不同类型的答案。同时，传闻Anthropic接近完成35亿美元的新一轮融资，估值可能达到615亿美元。Claude3.7Sonnet在遵循指令、一般推理、多模态能力和自主编码方面表现出色，尤其在数学和科学方面有显著提升，代码能力大幅超越DeepSeek R1和OpenAI的o1、o3模型。此外，Anthropic还发布了专注于代码的Agent Claude Code，可以直接在终端运行，帮助开发者完成编程任务。Claude3.7Sonnet已全面上线，但免费用户无法使用扩展思考模式，定价高于竞争对手的纯推理模型。

AI Guide

AI & Technology

AI in Education

DeepSeek is emerging, but ChatGPT maintains its leading position in American universities.

The Impact of AI Companions

The rise of AI girlfriends may affect the birth rate.

Value Reconfiguration

News List

36kr.AI

18 hours ago

DeepSeek带飞万元AI女友：单身狗福音，生育率躺枪

AI 女友来 “抢戏”，单身狗的福音，生育率 “emo”？

36kr.AI

18 hours ago

DeepSeek 解决不了 KPI，那些被「卷」到失眠的金融人

价值重构

36kr.AI

19 hours ago

AI 教育的分野：ChatGPT 风靡美国高校，DeepSeek 称霸国内

DeepSeek崛起，ChatGPT在美国高校如何稳占C位？

AI Guide

Artificial Intelligence

AI Development & Open Source Initiatives

DeepSeek has open-sourced DeepEP, the world's first full-stack communication library for MoE models, aiming to address AI computing power issues. DeepEP optimizes NVLink technology to increase data transfer speeds between GPUs and utilizes RDMA technology to reduce data transfer latency. It also features intelligent sorting and FP8 data compression to further improve data processing efficiency. DeepSeek also released FlashMLA code during "Open Source Week" to reduce costs in large model training. Through these open-source technologies, DeepSeek is helping to reduce costs for the entire industry chain.

US Restrictions on China's AI

The impact of the White House ban on the rise of Chinese AI is a key consideration.

News List

虎嗅.最新

19 hours ago

DeepSeek扔的第二枚开源王炸到底是什么？

DeepSeek开源了全球首个面向MoE模型的全栈通信库DeepEP，旨在解决AI算力问题。DeepEP通过优化NVLink技术，提升了GPU之间的数据传输速度，并利用RDMA技术降低了数据传输延迟。此外，DeepEP还具备智能分拣功能和FP8数据压缩技术，进一步提高了数据处理效率。DeepSeek还在“开源周”发布了FlashMLA代码，也是为了减少大模型训练过程中的成本。通过开源这些技术，DeepSeek正在帮助产业链上下游降低成本。然而，中国MaaS模式可能面临亏损，因机器成本高昂。

36kr.AI

20 hours ago

DeepSeek爆火一个月，留给美国的时间不多了

白宫禁令还挡得住中国AI崛起吗？

AI Guide

Finance

DeepSeek Large Model Application in Banking

Banks are increasingly adopting the DeepSeek large model, but challenges remain, particularly in addressing technical hallucinations.

News List

36kr.AI

21 hours ago

15家银行集体押注，DeepSeek如何掀起金融AI革命？

银行业加速布局DeepSeek大模型，仍需解决技术幻觉问题。

AI Guide

AI and Investment

Anthropic's Valuation Soars

Anthropic's valuation has climbed to $61.5 billion.

Claude 3.7 Sonnet Release

Anthropic has released Claude 3.7 Sonnet, featuring a hybrid reasoning mode to enhance thinking control and computational capabilities.

Reasoning Mode

The new model allows switching between two thinking modes and offers precise control over thinking time.

News List

36kr.AI

22 hours ago

全球首个混合推理模型降世，程序员集体过年，最强AI编程秒全场，多平台火速接入

可切换两种思考模式，精准把控思考时间。

36kr.AI

22 hours ago

中文比 R1 丝滑、玩宝可梦还贼溜？全球首个混合推理模型 Claude 3.7 Sonnet 太惊艳，网友直呼“孤独求败”

Claude 3.7 Sonnet发布，具混合推理模式，增强思考控制和计算能力。

36kr.AI

23 hours ago

实测Claude 3.7：3200行代码一口气输出，物理规律手拿把掐，弱智吧已失守

Anthropic估值涨到615亿美元

AI Guide

AI and Technology Updates

DeepSeek's DeepEP Communication Library

DeepSeek has launched DeepEP, a communication library tailored for Mixture of Experts (MoE) and Expert Parallelism (EP) systems. This library aims to boost the efficiency of large-scale AI training and inference by optimizing GPU kernels, particularly in data routing and output integration within MoE models.

Key features of DeepEP include native support for FP8 intelligent compression transmission and flexible GPU resource management, making it suitable for resource-constrained or real-time applications. By minimizing data transfer wait times and enhancing GPU utilization, DeepEP aims to reduce costs and increase efficiency in areas like natural language processing, code generation, and recommendation systems.

News List

36kr.AI

1 days ago

小红书也要DeepSeek船票

必须更主动一点

虎嗅.最新

1 days ago

榨干每一块GPU，DeepSeek开源第二天，送上降本增效神器

DeepSeek发布DeepEP，一个专为混合专家系统（MoE）和专家并行（EP）定制的通信库，旨在提升大规模AI训练和推理的效率。DeepEP通过优化GPU内核，尤其是在MoE模型中的数据路由和输出整合过程，实现了高效的全员协作通道和低延迟内核。其亮点包括原生支持FP8智能压缩传输和灵活调控GPU资源，适用于资源受限或实时性要求高的场景。通过高速通道和无缝换乘，DeepEP显著减少了数据传输的等待时间，提升了GPU的利用率，从而在自然语言处理、代码生成和推荐系统等领域实现降本增效。