让每一次创作沉淀为 AI 资产,加速专业影视创作。
一个 Agent,串起整支团队的影视工作流。
三项核心能力:业内领先的影像检索、覆盖全影视流程的 AI 工作流,以及可随团队持续进化的个性化 Agent。
背靠 CMU 大模型研究团队,专注专家数据、Agent 评估与数据飞轮驱动的后训练。多项顶会研究成果已被 Google DeepMind、字节跳动、xAI 等 100+ AI 实验室采用。
Turn every project into an AI asset. Accelerate professional film creation.
One agent that runs your studio's whole workflow.
Three core capabilities: industry-leading visual retrieval, an AI workflow spanning the full film pipeline, and a personalized agent shaped by your team.
Built by a CMU foundation-model research team, working on expert data curation, agent evaluation, and data-flywheel post-training. Top-conference research adopted by 100+ AI labs including Google DeepMind, ByteDance, and xAI.
头部工作室已经开始用 AI 重构工作流,最先跑通的团队,未来一两年将迅速拉开差距。 Top studios are already rebuilding their workflows around AI. The teams that get this right first will pull ahead within one to two years.
有甲方、有 deadline、有预算,创意对齐失败带来的是真金白银的返工成本。
他们愿意为真正解决问题的工具付费,也最能分辨工具是否专业。
整个影视行业都在重构 AI 工作流,但市面上仍缺少真正面向专业流程的产品。
需求早已存在,只差一个对的工具。
They have clients, deadlines, and budgets. When creative alignment fails, it costs real money in rework.
They'll pay for tools that genuinely solve the problem, and they can tell professional tools from amateur ones.
The film industry is redesigning workflows around AI, but no product on the market is truly built for professional pipelines.
The demand has been there for a while. What's missing is the right tool.
专业影视团队有三件事一直没被现有工具解决——这也正是 Moodio 的三个核心壁垒所对应的痛点: Three problems for professional film teams have never been properly solved by existing tools. They map directly to Moodio's three pillars:
现有工具都是给个人创作者做的,Moodio 能更好地服务专业影视团队。 Every existing tool was built for individual creators. Moodio serves professional film teams better.
导演脑子里有什么镜头,嘴上说出来就行。检索效果同时超过 Google Gemini Embedding 2 和阿里 Qwen-3-VL Embedding。Whatever shot the director has in mind, just describe it. Retrieval outperforms both Google Gemini Embedding 2 and Alibaba Qwen-3-VL Embedding.
找参考是专业团队每天都在做的事。一条对的参考,静帧也好、视频也好,团队和客户立刻就能对上方向,沟通成本也跟着降下来。在我们公开发布的影视检索基准 Moodio-T2V 上,Moodio 的检索效果同时超过 Google Gemini Embedding 2 和阿里 Qwen-3-VL Embedding。Finding references is what professional teams do every day. The right one, still or video, instantly aligns the team and the client, and communication cost drops on the spot. On our publicly released cinematic retrieval benchmark Moodio-T2V, Moodio outperforms both Google Gemini Embedding 2 and Alibaba's Qwen-3-VL Embedding.
举几个例子:"焦点从前景转到背景的变焦"、"从昏暗室内推到明亮室外"——这种带镜头变化的专业描述,只有 Moodio 能搜到。A few examples: "A rack focus from a foreground subject to the background street," "a push from a dim interior out into bright exterior." Only Moodio can retrieve professional descriptions like these.






一个 Agent,把整支团队的影视工作流全串起来。从创意到成片,全程在一块画布上。One agent that runs your studio's whole workflow. From ideation to production, all on one canvas.
全流程视频创作。End-to-end video creation. Moodio Agent 已经把 1. 搜参考、2. 写脚本、3. 出分镜表、4. 生成画面、5. 生成视频,到 6. 剪辑导出整条流程跑通了。从一个 idea 到一支可交付的成片,不用切换六七个工具,全程在一块画布上。Moodio Agent runs the full pipeline: 1. find references, 2. write the script, 3. build the shot list, 4. generate frames, 5. generate video, 6. edit and export. From an idea to a deliverable cut, all on one canvas, no tool-switching.
多人协作,少开会。Real-time collaboration, fewer meetings. 一个真实项目里,导演、摄影、美术、视频团队和甲方往往要为一个镜头反复沟通。Moodio 让所有人站在同一块画布上看同一份方案。任何人都能用一句自然语言调动 Agent:On a real project, the director, DP, art lead, video team, and client end up going back and forth on every shot. Moodio puts everyone on the same canvas looking at the same brief. Anyone can call the agent in plain language:
一句话讲清楚要做什么,Agent 就去做。甲方反馈也从"感觉不对"变成"这个方向换一下",原本要开三个会才能对齐的事,画布上就解决了。Say what you want in one sentence and the agent does it. Client feedback shifts from "something feels off" to "change this direction," and what used to take three meetings now happens on the canvas.
资产沉淀。Assets accumulate. 每一次搜参考、每一次方案讨论、每一次分镜调整,整个项目的创作过程都被结构化保存进 Agent 的记忆层。后期修改直接追溯到源头,下一个同类项目可以复用整套创作框架。Every reference search, every direction discussion, every storyboard tweak is structured and saved into the agent's memory layer. Later revisions trace back to the source, and the next similar project reuses the whole creative framework.
团队在 Moodio 上每做一个项目,都在为自己的专属 Agent 积累训练数据。项目越多,Agent 越懂这支团队。Every project a team ships on Moodio is training data for the team's own dedicated agent. More projects, sharper agent.
用户数据飞轮。User data flywheel. 每一次拒绝、修改、最终选定,都在告诉 Agent 这支团队的审美在哪里、底线在哪里。我们用这些数据持续做后训练,让 Agent 内化团队的视觉判断。项目越多,Agent 越懂这支团队;Agent 越懂团队,团队越愿意把更多项目放上来。Every rejection, revision, and final pick teaches the agent this team's taste and shows it where the bar is. We use that data for ongoing post-training, so the agent truly internalizes the team's visual judgment. The more projects, the sharper the agent. The sharper the agent, the more projects come in.
用户模拟器 · 团队的视觉助理分身。User Simulator · The team's visual-assistant twin. 数据攒够之后,Moodio 会为每支团队后训练出一个用户模拟器:一个能模仿团队思维方式的 Agent 分身。一部短剧拍到第十集,分身已经看过你们怎么取景、怎么打光、怎么剪每一个镜头。第十一集,它可以按你们的习惯先把初稿做出来。这时候生成已经不是抽卡,而是分身真懂这支团队怎么干活。新人加入工作室,分身直接把团队的审美和品质底线教给他,原本要几周磨合的事,几天就够了。Once enough data has accumulated, Moodio post-trains a user simulator for each team: an agent twin that mirrors how the team thinks. By the time a short series hits episode ten, the twin has watched how you frame, light, and cut every shot. On episode eleven, it can put together a first cut the way you would. Generation stops being a slot machine. It becomes the work of a twin that actually knows how this team works. When a new hire joins the studio, the twin instills the team's taste and standards from day one. What used to take weeks now takes a few days.
市面上想做的人不少,但没有一家把三件事同时做对:电影级检索、有记忆的 Agent、专业团队协作。 Plenty of teams are trying. None has gotten all three right at once: cinematic retrieval, an agent with memory, professional team collaboration.
目前最接近的有两类产品。TapNow 更偏自媒体创作者工具,画布和检索能力难以支撑大型影视项目,Agent 也缺少长期记忆。即梦 Octo 则更偏消费级创作,流程完整,但定位仍是面向泛创作者,而非专业影视团队。 The two closest competitors today are TapNow and Dreamina Octo. TapNow leans toward short-form creator tooling: its canvas and retrieval can't support large film projects, and the agent lacks long-term memory. Dreamina Octo is more of a consumer creative tool, full-pipeline but positioned for general creators rather than professional film teams.
它们都没有真正解决专业影视工作流的问题,因为从第一天起,就不是为这群人设计的。 Neither truly solves the professional film workflow, because from day one, neither was designed for this audience.
Moodio 不一样。我们从研究到产品,都围绕专业影视创作展开:每一项研究对应真实 production workflow,每一个产品功能都与行业团队共同打磨。谁在做、为谁做,决定了最后能做出什么。 Moodio is different. From research to product, everything we build centers on professional film creation: every paper maps to a real production workflow, and every feature is honed alongside industry teams. Who builds it, and for whom, determines what gets built.
横跨中美的团队,兼具 AI 研究、产品工程与专业影视制作能力。A team across the U.S. and China spanning AI research, product engineering, and professional filmmaking.

















我们的研究在视频理解和电影级生成评估这两个方向上已经是业内标杆。CVPR、NeurIPS、ECCV 顶会都有论文落地,Google DeepMind、字节跳动、xAI、可灵、Midjourney 等前沿实验室都在用,Hugging Face 累计下载超过 200 万。Our research has become a benchmark in video understanding and cinematic generation evaluation. Published at CVPR, NeurIPS, and ECCV. Adopted by frontier labs at Google DeepMind, ByteDance, xAI, Kling, Midjourney, and others. 2M+ cumulative downloads on Hugging Face.
让模型看懂任意视频里的运镜。已被 Google DeepMind、xAI、可灵等前沿视频实验室采用。Understanding camera motion in any video. Adopted by Google DeepMind, xAI, Kling, and frontier video labs.
人和 AI 协作标注视频。Moodio 的标注质量和检索精度就是从这套方法来的。Building precise video language with human-AI oversight. The framework behind Moodio's caption quality and retrieval precision.
文生图/视频的当前最强评估指标。Google DeepMind 把它列为 CLIPScore 最强替代方案。State-of-the-art metric for evaluating text-to-visual generation. Google DeepMind named it the strongest replacement for CLIPScore.
组合式文生图/视频评测基准。Google Imagen-4 官方技术报告独家引用。Benchmark for compositional text-to-visual generation. Uniquely adopted in the official Google Imagen-4 technical report.
第一次将视觉参考引入视频生成工作流。检索效果同时超过 Google Gemini Embedding 2 和阿里 Qwen-3-VL Embedding。The first to bring visual reference into the video generation workflow. Retrieval outperforms both Google Gemini Embedding 2 and Alibaba Qwen-3-VL Embedding.
第一个跑完整套视频制作流程的 Agent,从真实用户使用里学奖励。专业制作的数据飞轮。First end-to-end video production agent with reward learning from real user interactions. The data flywheel behind studio-grade workflow.
品牌宣传片、电影节入围短片、小红书爆款内容,均已真实交付上线。 Brand films, festival selections, viral content. All shipped, all real.

Moodio 在中美两地能触达 200+ 位专业创作者,覆盖制片、美术、AI 导演、摄影、配乐等环节。常驻合作的创作者包括: Moodio reaches 200+ professional creators across China and the US, spanning production, art direction, AI direction, cinematography, and music. Regular collaborators include:
点击姓名查看完整经历。 Click a name to see full credits.
Agent 不仅能成为专业团队的专属分身,也能将顶尖创作者的经验开放给更多人。The agent can be a dedicated twin for any professional team, and a way to share top-tier creative expertise with everyone else.
专业团队在 Moodio 上做的项目越多,平台积累的创意智能就越厚。等到积累够厚,Moodio 后训练出来的 Agent 可以手把手带着没有影视背景的人做出专业级的可交付影像:从找参考、定风格、出分镜,到生成画面、剪辑导出,Agent 全程辅助决策。Moodio 的终点,是整个创意行业的智能基础设施。The more projects professional teams ship on Moodio, the more the platform's creative intelligence compounds. Once it's deep enough, the agent Moodio post-trains can walk someone with no film background through a professional, deliverable piece of work: finding references, setting style, building shot lists, generating frames, and final editing. The agent guides every decision. Moodio's endpoint is the intelligent infrastructure the entire creative industry runs on.
产品在内测阶段,年经常性收入已经做到 $500K,月环比 100% 增长,仍未进行公开市场推广。这一轮融资做两件事:把已经跑通的产品扩到更多工作室;把下一阶段的研究做透,让每家工作室的创作历史,变成他们自己的 Agent。Product in private beta. $500K ARR, 100% month-over-month growth, no public-facing marketing yet. This raise does two things: take what's already working to more studios, and finish the next research push, turning each studio's creative history into a creative twin that's truly their own.
把产品送到本来就该用它的工作室手上。中国和北美两个市场的销售、合作、内容投放都跟上。Reach the studios this is built for. Sales, partnerships, and content across China and North America.
把用户模拟器先做出来,让 Agent 真正理解每支团队的创作习惯,生成从靠抽卡变成靠理解。Ship the user simulator. Make the Agent truly understand each team's creative habits. Generation stops being a slot machine and becomes informed work.
中美两地继续招研究和工程人才。把电影级检索的优势再拉大,把画布协作和资产沉淀做扎实。Hire research and engineering across the U.S. and China. Widen the cinematic retrieval lead. Make canvas collaboration and asset accumulation rock-solid.
我们做的不只是一个工具。把专业团队沉淀下来的创意智能开放给所有人,以前只有顶级团队能做的内容,大众创作者也能做出来。Moodio 的终点是整个创意行业的智能基础设施。We're not just building a tool. The creative intelligence professional teams accumulate here will become accessible to everyone. Content that used to be the domain of top studios will be within reach of any creator. Moodio's endpoint is the intelligent infrastructure the entire creative industry runs on.
$500K ARR,月环比翻倍,仍未进行公开市场推广。本轮融资意向 $2–5M。$500K ARR. Doubling month over month. No public-facing marketing yet. Round size $2–5M.
投资意向交流 · Investor inquiries · zhiqiulin98@gmail.com