用人工智能With Artificial Intelligence

拓展人类的创造边界

Expand the Boundaries of Human Creativity with AI

让动画电影与电视制作

因 AI 而更自由

Empowering Animation & TV Production for Everyone

创意人工智能研究组 AI for Creativity Research Team

AI for Creativity Research Team Creative AI for art, cinema, and future production

创新Innovation 艺术Art 科学Science 协作Collaboration

AI × 艺术 AI × Art AI × 艺术

AI × Art AI × Art

动画电影 Animation & Film 动画电影

Animation & Film Animation & Film

电视制作民主化 TV Production Democratization 电视制作民主化

TV Production Democratization TV Production Democratization

开放协作研究 Open & Collaborative Research 开放协作研究

Open & Collaborative Research Open & Collaborative Research

技术是画笔,想象是色彩,合作是画布。我们共同绘制未来的创意图景。 Technology is the brush, imagination the color, and collaboration the canvas.

Technology is the brush, imagination the color, and collaboration the canvas. Together, we paint the future of creativity. Together, we paint the future of creativity.

01 Chapter

PhilosophyPhilosophy

研究核心理念Core Research Vision

秉持“以 AIGC 技术赋能艺术创作与内容生产,开启独立创作时代”的核心理念,本计划以影视级动画作为研究切入点。电影是视听艺术的综合载体,天然融合音乐、画面、色彩、叙事、动效与情感等多重创作要素,为人工智能理解“创作”提供了极具代表性的多模态样本。We anchor our research in film-grade animation because cinema is a unified vessel of audio-visual art — music, image, color, narrative, motion, emotion — which makes it the richest multimodal sample for an AI that tries to understand creation rather than merely imitate it.

  • 电影级标准Cinematic Standard

    我们关注的不只是生成质量本身,而是镜头语言、视听协同、叙事节奏、角色一致性与情感传达等真正决定作品完成度的关键维度。Beyond raw generation quality, we care about cinematography, audio-visual coherence, narrative pacing, character consistency, and emotional delivery.

  • 设计思维Design Thinking

    让 AI 具备顶层创作设计能力,从被动的素材生成器进化为具备设计思维、能够参与创作决策的创意智能体。We want AI to evolve from a passive generator into a creative agent capable of top-level design decisions across modalities.

  • 工作流落地Workflow Landing

    真正进入 2D / 3D 动画的工业化生产流水线,让生成式系统从繁琐工作流走向更纯粹的创意流。Generative systems must enter the real 2D / 3D animation pipeline — turning a tangled workflow into a clean creative flow.


02 Chapter

Research DirectionsResearch Directions

研究方向Two Research Pathways

我们的研究围绕两条主线协同推进:通用创意智能关注机器创造力的边界,智能创意工作流关注人类创造力的释放方式,二者并行构成从创意理解到创意生产的完整研究闭环。Our research advances along two pathways. General Creative Intelligence explores the frontier of machine creativity; Intelligent Creative Workflow focuses on releasing human creativity. Together they form a closed loop from creative understanding to creative production.

01

通用创意智能General Creative Intelligence

General Creative Intelligence通用创意智能

从“素材生成器”走向“具有设计思维的创意智能体”。From asset generator to design-thinking creative agent.

我们关注 AI 如何获得在创作中进行顶层设计的能力。导演构思叙事作品时,本质上是在脑海中同时组织声音、画面、动作、节奏与故事逻辑,进行跨模态的整体想象。本方向重点研究多模态感知、跨模态语义对齐、设计元素建模、美学认知与创作推理,使模型能够理解声音、画面、叙事与情绪之间的复杂关系。A film director, when conceiving a narrative, orchestrates sound, image, action, pacing, and story logic simultaneously — a cross-modal act of imagination. This pathway studies how AI can acquire that top-level design capability through multimodal perception, cross-modal alignment, aesthetic cognition, and creative reasoning.

  • 多模态感知Multimodal Perception
  • 跨模态对齐Cross-modal Alignment
  • 创作推理Creative Reasoning
  • 美学认知Aesthetic Cognition
  • 设计思维智能体Design-Thinking Agent
02

智能创意工作流Intelligent Creative Workflow

Intelligent Creative Workflow智能创意工作流

让工具适应创作,而不是让艺术家适应工具。Let the tools adapt to creators — not the other way around.

我们关注如何将创意智能体真正引入内容生产流程,减少 2D / 3D 动画制作中视觉预览、冷启动建模、补间帧、多镜头衔接等耗时环节的重复劳动。覆盖从 1D 音频、2D 图像、3D 几何资产到 4D 视频的全数据形态,打破单一图文驱动范式,构建更贴近影视动画创作直觉的多模态交互机制。We bring creative agents into real production, reducing mechanical steps in 2D / 3D animation: previsualization, cold-start 3D modeling, inbetweening, shot transitions, asset recomposition. We cover the full data spectrum — 1D audio, 2D images, 3D geometry, 4D video — and break the single text-to-image paradigm with multimodal interaction grounded in artist intuition.

  • 智能体落地Agent Deployment
  • 2D/3D/4D 内容2D/3D/4D Content
  • 多模态交互Multimodal Interaction
  • 影视动画工作流Film & Animation Workflow
  • Workflow → Creative FlowWorkflow → Creative Flow

03 Chapter

Publications & ProjectsPublications & Projects

代表成果Featured Work

每项成果都试图回答一个具体的问题,而不仅仅是一次实验。下面是我们目前最具代表性的方向锚点。Each piece of work answers a specific question rather than running an isolated experiment. The following are the anchor points of our current research line.

ViStoryBench

面向视觉故事讲述的综合性基准测试。A comprehensive benchmark suite for story visualization.

针对定制化视觉故事讲述这一电影生成的核心任务,构建系统性评测框架,衡量模型在角色一致性、风格稳定性、剧情对齐、构图质量与故事可视化方面的真实能力。Targeting customized visual storytelling — the core task of cinematic generation — ViStoryBench defines a systematic evaluation framework for character consistency, style stability, plot alignment, compositional quality, and visualization.

StyleMe3D

3D 高斯下的解耦先验风格化。Stylization with disentangled priors on 3D Gaussians.

在 3D 高斯表示下探索可控、可交互、可扩展的风格化生成,通过多编码器与解耦先验实现稳定的风格控制,体现团队对 3D 作为创作媒介的长期判断。Multi-encoder, disentangled-prior stylization for 3D Gaussian representations — exploring controllable, interactive, scalable 3D creation as a long-term creative medium.

ShotStudio

面向完整影视创作流程的 AIGC 框架。An AIGC framework for end-to-end film creation.

将创意理解、镜头组织、素材生成与创作流程协同串联起来,是面向未来电影生成与影视创作系统的重要框架原型,也是不断演化的创意工作流实验场。An evolving framework that integrates creative understanding, shot organization, asset generation, and production coordination — a pioneering scaffold for future cinematic generation systems.

Act2Cut

多镜头视频叙事的动作连续性建模。Continuous multi-shot video narrative — match on action.

聚焦电影语言中“动作匹配剪辑”这一长期被忽视的关键问题,让生成式视频系统具备跨镜头动作连续性理解,强调叙事动作与电影语法之间的协同。Tackling the long-overlooked “match on action” problem in cinematic language, Act2Cut enables generative video systems to understand action continuity across shots — narrative motion, not just surface frame similarity.


04 Chapter

TeamTeam

团队成员Team Members

AIGC Research 由一支关注创意人工智能、生成式模型与跨学科基础研究的小型团队组成,并与工业界、学术界导师保持持续合作,共同推进从基础问题到创作系统落地的全链路研究。AIGC Research is a small team focused on creative AI, generative models, and cross-disciplinary fundamentals. We collaborate with both industry and academic advisors across the full chain — from foundational questions to deployed creative systems.

庄才林Cailin Zhuang

团队成员Team Member

上海科技大学硕士M.Sc., ShanghaiTech University

研究方向为 AIGC,关注创意人工智能与影视级内容生成。Working on AIGC, creative intelligence, and cinematic content generation.

胡耀淇Yaoqi Hu

团队成员Team Member

简介待补充。Bio coming soon.

董政Zheng Dong

团队成员Team Member

简介待补充。Bio coming soon.

程巍Wei Cheng

工业界导师Industry Advisor

阶跃星辰 · 研究科学家StepFun · Research Scientist

研究方向为生成式人工智能。Researches generative AI.

夏清零Qingling Xia

学术界导师Academic Advisor

重庆理工大学Chongqing University of Technology

研究方向为生物医学工程。Researches biomedical engineering.


05 Chapter

Get in TouchGet in Touch

合作与联系Collaborate with Us

如果您对我们的研究方向、项目合作、学术交流、开源共建或创意生产系统落地感兴趣,欢迎与我们取得联系。让我们共同塑造一个全新的创意世界。If our research directions, open-source efforts, academic exchanges, or creative production systems resonate with you, we warmly invite you to connect. Let's shape a novel creative world together.

微信公众号WeChat

微信公众号

扫码关注 AIGC ResearchScan to follow AIGC Research