AI Guide
AI & Tech
DeepSeek Applications and Strategic Thinking
- DeepSeek is now being smoothly applied in cross-border e-commerce.
- The success of "Nezha 2" and DeepSeek shares a venture capital-style mindset. In the face of complex and rapidly changing environments, enterprises and individuals should learn from the venture capital industry, actively embrace uncertainty, dare to take risks, not be afraid of failure, and be brave to innovate.
DeepSeek's Open-Source Matrix Multiplication Library: DeepGEMM
- DeepSeek has open-sourced DeepGEMM, a matrix multiplication library optimized for the Hopper architecture GPU, supporting standard matrix calculations and Mixture of Experts (MoE) calculations.
- This library provides strong support for the training and inference of DeepSeek-V3/R1.
- The core code is only about 300 lines and outperforms existing solutions in most matrix sizes.
- DeepGEMM utilizes HopperGPU's tensor cores and TMA technology to optimize FP8 precision matrix multiplication.
- In standard matrix multiplication, DeepGEMM is 1.0 to 2.7 times faster than optimized implementations based on CUTLASS3.6. In Mixture of Experts model calculations, continuous arrangement is accelerated by about 1.1 to 1.2 times, and mask arrangement is also accelerated by 1.1 to 1.2 times.
News List
36kr.AI
1 hours ago
虎嗅.最新
1 hours ago
一文详解:DeepSeek刚开源的DeepGEMM是怎么回事?
DeepSeek开源了专为Hopper架构GPU优化的矩阵乘法库DeepGEMM,支持标准矩阵计算和混合专家模型(MoE)计算,为DeepSeek-V3/R1的训练和推理提供强大支持。该库核心代码仅约300行,在大多数矩阵尺寸下性能优于现有解决方案,采用即时编译技术,无需安装时编译,代码结构清晰易懂。在标准矩阵乘法中,DeepGEMM相比基于CUTLASS3.6的优化实现,速度提升1.0到2.7倍不等;在混合专家模型计算中,连续排列方式提速约1.1到1.2倍,掩码排列方式也能提速1.1到1.2倍。DeepGEMM利用HopperGPU的张量核心和TMA技术,优化FP8精度矩阵乘法,确保计算结果准确可靠。
虎嗅.最新
1 hours ago
风投式思维:哪吒2和DeepSeek背后的共同思维模式
《哪吒2》和DeepSeek的成功,共享着风投式思维模式。传统上经营企业和研发产品的那种渐进、连续且保守的方式早已不再适用,面对复杂且快速变化的环境,企业和个人都应该学习风险投资行业,积极拥抱不确定性,敢于承担风险,不惧失败,勇于创新。风投式思维具有接受不确定性、鼓励创新、将失败视为必需、大胆决策、保持敏捷行动、注重长期发展等核心特征,根据这种思维原则,失败虽有成本,但和一旦成功所获得的巨大收益相比,失败的成本其实非常低。
AI Guide
Tech Industry Updates
New Deep Learning Models Released
Several tech giants have recently launched new deep reasoning models. This surge in model development is indicative of the growing competition in the artificial intelligence sector, where firms are investing heavily in advanced technologies to enhance their offerings.
Exponential Financing and Valuation Increases
In conjunction with these new model releases, companies are experiencing excess financing, leading to a remarkable valuation increase of up to two times. The influx of capital reflects investors' confidence in the potential of AI and its applications across various industries.
Built-in JIT and Clean Implementation
The latest models feature built-in Just-In-Time (JIT) compilation capabilities, showcasing a commitment to efficiency and performance. The implementation is noted for its clean design, making it easier for developers to integrate and utilize these new technologies.
News List
36kr.AI
4 hours ago
36kr.AI
4 hours ago
36kr.AI
4 hours ago
AI Guide
AI
AI Model Applications
The focus is on AI large model applications.
Finance
Biren Technology IPO
Biren Technology is considering a Hong Kong IPO and plans to raise $300 million.
News List
36kr.AI
5 hours ago
36kr.AI
5 hours ago
AI Guide
AI & Machine Learning
Open Source Trends in AI
The trend of AI open source is emerging, with inference models becoming mainstream.
DeepSeek's Impact and Future AI Development
Redpoint Ventures partner Jacob Effron and David Luan discussed DeepSeek's implications for the large model space, suggesting its success indicates a shift towards improving model efficiency. Luan anticipates that large "teacher models" will be trained internally and then compressed for client use. The increasing complexity of AI applications, from simple chat to drug discovery, requires more intelligent models, with reinforcement learning playing a key role in enhancing agent intelligence.
AI Search and Advertising
Controversy arose around AI search tool DeepSeek due to suspected advertising within its answers, showing bias in recommending purchasing channels. This has led to discussions about the possibility of advertising integration in AI search. While technically feasible and seen as valuable by advertisers, user acceptance, content accuracy, and privacy issues remain challenges. Domestic AI search platforms are cautious about incorporating ads.
News List
虎嗅.最新
6 hours ago
广告盯上DeepSeek
近日,AI搜索工具DeepSeek因答案中疑似“夹带”广告引发争议,用户发现其在推荐购买渠道时存在倾向性,甚至有商家借DeepSeek名义推销商品,可能误导用户。尽管腾讯否认了相关指控,但AI搜索中广告植入的可能性引发关注。技术上,AI搜索加广告不难实现,国外已有先例,如Perplexity AI。广告商看好AI搜索的精准性和用户价值,但用户接受度、内容准确性和隐私问题是潜在挑战。目前国内AI搜索平台对广告接入持谨慎态度,用户需自行辨别AI搜索结果中的广告内容。
36kr.AI
6 hours ago
虎嗅.最新
6 hours ago
OpenAI早期员工David Luan最新访谈:DeepSeek并未改变AI技术的叙事
红点创投合伙人Jacob Effron与David Luan探讨了DeepSeek对大模型领域的启示,认为其成功表明AI发展重心已转向提升模型效率。Luan强调,即使模型更高效,对智能的追求也不会停止。他预测,未来大型“教师模型”将在内部实验室训练,再压缩成高效模型供客户使用。Luan还提到,人工智能的应用场景复杂度不断提升,从简单聊天到复杂任务如药物发现,都需要更智能的模型。他认为强化学习在提升Agent智能方面具有重要价值。
36kr.AI
6 hours ago
36kr.AI
6 hours ago
AI Guide
AI Research Assistants
OpenAI Expands Deep Research AI Agent
OpenAI has broadened access to its powerful Deep Research AI agent, now available to ChatGPT Plus, Team, Education, and Enterprise users. This expansion is expected to heighten competition with other players like DeepSeek and Anthropic within the swiftly advancing AI research assistant market.
DeepSeek's Open Source Success and Commercialization Challenges
DeepSeek, having achieved a successful 1.0 phase as an open-source software, has attracted major tech companies such as Tencent, Baidu, and Alibaba. However, it is facing challenges in its commercialization strategy. Despite having substantial funding, the need for a solid business model is crucial for leveraging its technological advancements effectively. This analysis explores three business models for open-source technology:
- Google's approach of “establishing rules” to control core services like GMS.
- MySQL's model of offering an enterprise version alongside a partially open-source product.
- The “open source + product/service” model exemplified by Chrome and RedHat, which may serve as a primary model for DeepSeek: generating revenue through services rather than direct model sales. The focus is on converting user numbers into value through branding and service offerings.
DeepSeek Launches Open Source Week
DeepSeek has kicked off its “Open Source Week,” which highlights the importance of open-source development and its implications. This initiative could hold significant relevance for developers and organizations engaged in open-source projects, fostering collaboration and innovation within the tech community.
News List
虎嗅.最新
7 hours ago
DeepSeek闯进更难的第二关
DeepSeek作为开源软件已实现1.0阶段的成功,吸引腾讯、百度、阿里等大厂接入。但其商业化模式面临挑战,尽管幻方不缺钱,但好的商业模式是技术飞轮的必要条件。文章分析了三种开源技术的商业模式:一是如谷歌安卓通过“建立规则”控制GMS核心服务;二是如MySQL通过部分开源提供企业版本;三是如Chrome和RedHat通过“开源+产品/服务”模式,其中Chrome和RedHat的模式可能成为DeepSeek的主要商业模式:模型不赚钱,用服务赚钱。通过品牌和服务,将用户数转化为价值。
36kr.AI
7 hours ago
VentureBeat
7 hours ago
OpenAI drops Deep Research access to Plus users, heating up AI agent wars with DeepSeek and Claude
OpenAI expands its powerful Deep Research AI agent to ChatGPT Plus, Team, Education, and Enterprise users, intensifying competition with DeepSeek and Anthropic in the rapidly evolving AI research assistant market.
AI Guide
Artificial Intelligence
OpenAI DeepResearch Expansion
OpenAI has broadened access to its DeepResearch AI agent, now available to ChatGPT Plus, Team, Edu, and Enterprise users. Pro users receive 120 deep research queries monthly, while other users get 10. DeepResearch, leveraging a version of the o3 model, conducts multi-step research by collecting web information and synthesizing it into detailed reports. It can search and interpret texts and images, generating reports in 5-30 minutes, complete with citations and summaries of its reasoning, a process OpenAI claims would take humans hours.
News List
ZDNet.AI
10 hours ago
OpenAI’s Deep Research can save you hours of work – and now it’s a lot cheaper to access
OpenAI has expanded access to its DeepResearch AI agent to ChatGPT Plus, Team, Edu, and Enterprise users, with Pro users having 120 deep research queries per month while the other users have 10 queries per month. Powered by a version of the o3 model, DeepResearch conducts multi-step research by gathering information from the web and synthesizing it into a comprehensive report. It can search and interpret texts and images and generate a report in 5-30 minutes, including citations and a summary of its thinking. OpenAI says this would take humans hours.
AI Guide
AI
OpenAI DeepResearch Expansion and Features
- OpenAI is expanding its DeepResearch AI agent, initially for ChatGPT Pro, to Plus, Team, Edu, and Enterprise users. This tool enables ChatGPT to conduct in-depth research and synthesize information into comprehensive reports within 5-30 minutes, citing sources. Plus users receive 10 queries monthly, while Pro users get 120. The tool now includes embedded images with citations and improved document analysis.
OpenAI's Decision to Withhold DeepResearch API
- OpenAI will not release the DeepResearch AI model via its developer API due to concerns about AI's persuasive capabilities and potential for spreading misinformation. The company is revising its methods for assessing “real-world persuasion risks.” Though deemed unsuitable for mass disinformation campaigns due to cost and speed, OpenAI will investigate factors like personalized persuasive content, as tests showed the model could write persuasive arguments, but not better than humans. DeepResearch is powered by a version of the o3 model.
News List
ZDNet.AI
11 hours ago
OpenAI’s Deep Research agent can do in 5 minutes what takes you hours – and now it’s a lot cheaper
OpenAI has launched DeepResearch, an AI agent in ChatGPT that conducts in-depth research and synthesizes information into comprehensive reports. Initially for ChatGPT Pro users, it’s now rolling out to Plus, Team, Edu, and Enterprise users, with varying query limits. Powered by a version of the o3 model, DeepResearch can analyze vast amounts of web content, including texts and images, and generate reports in 5-30 minutes. The reports include citations and summaries. OpenAI is withholding the model from its developer API to assess risks of AI convincing people.
TechCrunch
11 hours ago
Why OpenAI isn’t bringing deep research to its API just yet
OpenAI will not bring the AI model powering DeepResearch to its developer API due to concerns about AI’s ability to persuade and potentially spread misinformation. The company is revising its methods for assessing “real-world persuasion risks.” While OpenAI believes DeepResearch is not suited for mass disinformation campaigns due to its computing costs and speed, it intends to explore factors like personalized persuasive content. Tests showed the DeepResearch model performed well in writing persuasive arguments, but not better than humans.
Engadget
11 hours ago
OpenAI expands Deep Research to all paying ChatGPT users
OpenAI is rolling out its DeepResearch tool to ChatGPT Plus, Team, Edu, and Enterprise users, after initially launching it for Pro users. This feature allows ChatGPT to create in-depth reports on various subjects. Plus users will get 10 DeepResearch queries per month, while Pro subscribers now have 120. OpenAI has also improved the tool by embedding images with citations and enhancing document analysis. Users can access DeepResearch by tapping the icon before sending a request to OpenAI.
AI Guide
AI Development & Applications
AI Model Updates and Competition
Chinese AI startup DeepSeek has reopened its API after a three-week pause due to capacity issues. Their R1 "reasoning" model rivals OpenAI's. Meanwhile, Alibaba launched a preview of its QwQ-Max reasoning AI model, planning to open-source it, indicating growing competition in the Chinese AI market.
AI Tools in Research
The author shares their evolving perspective on AI tools, embracing them for research purposes. They use Ollama with the Llama3.2 LLM for quick answers and DeepSeekR1 with the Mysty GUI for in-depth research, significantly enhancing their research process.
Workplace Dynamics
The Importance of Workplace Relationships
Employees who feel connected and can collaborate are more likely to succeed and be happy. Social connections increase life satisfaction and well-being, while isolation can lead to burnout. Hybrid work models pose challenges to maintaining meaningful workplace relationships, requiring future solutions.
News List
The Verge
16 hours ago
How AI PCs are removing barriers to workplace connection
The importance of relationships in the workplace is highlighted, emphasizing that employees who feel connected and can collaborate are more likely to succeed and be happy. Research indicates that social connections increase life satisfaction and well-being, while isolation can lead to burnout. Employee happiness is linked to better work outcomes and success. The rise of hybrid work models poses challenges to maintaining meaningful workplace relationships, with most businesses planning to continue with remote work options. Many employees struggle to maintain meaningful connections in this new professional landscape, and future solutions need to address this issue.
ZDNet.AI
16 hours ago
I was an AI skeptic until until these 5 tools changed my mind
The author shares their evolving perspective on AI tools, moving from initial skepticism to embracing them for specific purposes like research. They highlight the efficiency of AI in quickly understanding complex concepts, replacing traditional search engines in their workflow. The author primarily uses Ollama, a command-line AI tool, with the Llama3.2 LLM for fast and concise answers. They also use DeepSeekR1 with the Mysty GUI for more in-depth research, particularly when exploring complex topics for creative projects. The combination of these tools has significantly enhanced their research process.
TechCrunch
16 hours ago
DeepSeek reopens access to its API after three-week pause
Chinese AI startup DeepSeek has reopened its API after a three-week halt due to capacity constraints. Customers can now top up credits to use DeepSeek’s AI, but server resources remain strained during daytime. DeepSeek gained prominence with its R1 “reasoning” model, rivaling OpenAI’s models, prompting OpenAI to consider open-sourcing more technology. Meanwhile, Chinese tech giant Alibaba launched a preview of its latest reasoning AI model, QwQ-Max, planning to open-source it, indicating increasing competition in the Chinese AI market.
AI Guide
AI Model Updates
Anthropic's Claude 3.7 Sonnet
Anthropic has launched its first "hybrid model," Claude 3.7 Sonnet, integrating real-time responses with deep-thinking capabilities, allowing users to obtain various types of answers without switching models. The model excels in following instructions, general reasoning, multi-modal capabilities, and autonomous coding, with significant improvements in mathematics and science. Its coding abilities surpass DeepSeek R1 and OpenAI's o1 and o3 models.
Agent Claude Code
Anthropic has also released Agent Claude Code, designed specifically for coding tasks. It can run directly in the terminal, assisting developers with programming tasks.
Claude 3.7 Sonnet Availability and Pricing
Claude 3.7 Sonnet is now fully available, although the extended thinking mode is not accessible for free users, and its pricing is higher than competitors' pure reasoning models.
Anthropic's Funding
Anthropic is reportedly nearing the completion of a new $3.5 billion funding round, potentially valuing the company at $61.5 billion.
News List
虎嗅.最新
17 hours ago
DeepSeek头号黑粉这下爽到了
Anthropic发布首个“混合模型”Claude3.7Sonnet,该模型整合实时应答和深度思考,用户无需切换即可获得不同类型的答案。同时,传闻Anthropic接近完成35亿美元的新一轮融资,估值可能达到615亿美元。Claude3.7Sonnet在遵循指令、一般推理、多模态能力和自主编码方面表现出色,尤其在数学和科学方面有显著提升,代码能力大幅超越DeepSeek R1和OpenAI的o1、o3模型。此外,Anthropic还发布了专注于代码的Agent Claude Code,可以直接在终端运行,帮助开发者完成编程任务。Claude3.7Sonnet已全面上线,但免费用户无法使用扩展思考模式,定价高于竞争对手的纯推理模型。
AI Guide
AI & Technology
AI in Education
DeepSeek is emerging, but ChatGPT maintains its leading position in American universities.
The Impact of AI Companions
The rise of AI girlfriends may affect the birth rate.
Value Reconfiguration
News List
36kr.AI
18 hours ago
36kr.AI
18 hours ago
36kr.AI
19 hours ago
AI Guide
Artificial Intelligence
AI Development & Open Source Initiatives
DeepSeek has open-sourced DeepEP, the world's first full-stack communication library for MoE models, aiming to address AI computing power issues. DeepEP optimizes NVLink technology to increase data transfer speeds between GPUs and utilizes RDMA technology to reduce data transfer latency. It also features intelligent sorting and FP8 data compression to further improve data processing efficiency. DeepSeek also released FlashMLA code during "Open Source Week" to reduce costs in large model training. Through these open-source technologies, DeepSeek is helping to reduce costs for the entire industry chain.
US Restrictions on China's AI
The impact of the White House ban on the rise of Chinese AI is a key consideration.
News List
虎嗅.最新
19 hours ago
DeepSeek扔的第二枚开源王炸到底是什么?
DeepSeek开源了全球首个面向MoE模型的全栈通信库DeepEP,旨在解决AI算力问题。DeepEP通过优化NVLink技术,提升了GPU之间的数据传输速度,并利用RDMA技术降低了数据传输延迟。此外,DeepEP还具备智能分拣功能和FP8数据压缩技术,进一步提高了数据处理效率。DeepSeek还在“开源周”发布了FlashMLA代码,也是为了减少大模型训练过程中的成本。通过开源这些技术,DeepSeek正在帮助产业链上下游降低成本。然而,中国MaaS模式可能面临亏损,因机器成本高昂。
36kr.AI
20 hours ago
AI Guide
Finance
DeepSeek Large Model Application in Banking
Banks are increasingly adopting the DeepSeek large model, but challenges remain, particularly in addressing technical hallucinations.
News List
36kr.AI
21 hours ago
AI Guide
AI and Investment
Anthropic's Valuation Soars
Anthropic's valuation has climbed to $61.5 billion.
Claude 3.7 Sonnet Release
Anthropic has released Claude 3.7 Sonnet, featuring a hybrid reasoning mode to enhance thinking control and computational capabilities.
Reasoning Mode
The new model allows switching between two thinking modes and offers precise control over thinking time.
News List
36kr.AI
22 hours ago
36kr.AI
22 hours ago
中文比 R1 丝滑、玩宝可梦还贼溜?全球首个混合推理模型 Claude 3.7 Sonnet 太惊艳,网友直呼“孤独求败”
Claude 3.7 Sonnet发布,具混合推理模式,增强思考控制和计算能力。
36kr.AI
23 hours ago
AI Guide
AI and Technology Updates
DeepSeek's DeepEP Communication Library
DeepSeek has launched DeepEP, a communication library tailored for Mixture of Experts (MoE) and Expert Parallelism (EP) systems. This library aims to boost the efficiency of large-scale AI training and inference by optimizing GPU kernels, particularly in data routing and output integration within MoE models.
Key features of DeepEP include native support for FP8 intelligent compression transmission and flexible GPU resource management, making it suitable for resource-constrained or real-time applications. By minimizing data transfer wait times and enhancing GPU utilization, DeepEP aims to reduce costs and increase efficiency in areas like natural language processing, code generation, and recommendation systems.
News List
36kr.AI
1 days ago
虎嗅.最新
1 days ago
榨干每一块GPU,DeepSeek开源第二天,送上降本增效神器
DeepSeek发布DeepEP,一个专为混合专家系统(MoE)和专家并行(EP)定制的通信库,旨在提升大规模AI训练和推理的效率。DeepEP通过优化GPU内核,尤其是在MoE模型中的数据路由和输出整合过程,实现了高效的全员协作通道和低延迟内核。其亮点包括原生支持FP8智能压缩传输和灵活调控GPU资源,适用于资源受限或实时性要求高的场景。通过高速通道和无缝换乘,DeepEP显著减少了数据传输的等待时间,提升了GPU的利用率,从而在自然语言处理、代码生成和推荐系统等领域实现降本增效。