通用创意智能 CGICreative General Intelligence (CGI)

探索机器创造力的边界

Expand the Boundaries of Machine Creativity

智能创意工作流 ICWIntelligent Creative Workflow (ICW)

探索人类创造力的边界

Empowering the Boundaries of Human Creativity

创意人工智能研究组 AI4C Research Team

AI for Creativity Research Team AI for Creativity Research Team

创新Innovation 创意Creativity 科学Science 艺术Art

使命与理念 Mission & Philosophy

Mission & Philosophy

我们致力于生成式人工智能与人类创意的深度融合，推动艺术、内容与影视的边界，让创意力量更高效、也更具力量。 We align generative AI with human creativity to expand the frontier of art, content, and cinematic production.

了解更多 Learn More

研究团队 Our Team

Our Team

跨学科、小而精的研究与创作团队，探索未来创意智能的可能性。 A exquisite and interdisciplinary team exploring the future of creative intelligence together.

认识团队 Meet the Team

学术成果 Publications

Publications

围绕AIGC、电影制作与世界模型等方向的持续研究，从问题定义、方法设计到系统落地。 Ongoing work on AIGC, filmmaking, and world model, et.al.

浏览成果 View Publications

未来方向 Future Works

Future Works

探索下一代创意智能体与生产工具，构建开放、可持续的创作生态。 Exploring the next generation of creative agents and production tools.

探索未来 Explore Future

AI × 艺术 AI × Art

AI × Art

影视制作 TV & Filmmaking

TV & Filmmaking

世界模型 World Model

World Model

科学智能 AI for Science

AI for Science

技术是画笔，想象是色彩，合作是画布。我们共同绘制未来的创意图景。 Technology is the brush, imagination the color, and collaboration the canvas.

Technology is the brush, imagination the color, and collaboration the canvas. Together, we paint the future of creativity. Together, we paint the future of creativity.

01 Chapter

PhilosophyPhilosophy

研究核心理念Core Research Vision

秉持“以 AIGC 技术赋能艺术创作与内容生产，开启独立创作时代”的核心理念，本计划以影视级动画作为研究切入点。电影是视听艺术的综合载体，天然融合音乐、画面、色彩、叙事、动效与情感等多重创作要素，为人工智能理解“创作”提供了极具代表性的多模态样本。We anchor our research in film-grade animation because cinema is a unified vessel of audio-visual art — music, image, color, narrative, motion, emotion — which makes it the richest multimodal sample for an AI that tries to understand creation rather than merely imitate it.

电影级标准Cinematic Standard

我们关注的不只是生成质量本身，而是镜头语言、视听协同、叙事节奏、角色一致性与情感传达等真正决定作品完成度的关键维度。Beyond raw generation quality, we care about cinematography, audio-visual coherence, narrative pacing, character consistency, and emotional delivery.
设计思维Design Thinking

让 AI 具备顶层创作设计能力，从被动的素材生成器进化为具备设计思维、能够参与创作决策的创意智能体。We want AI to evolve from a passive generator into a creative agent capable of top-level design decisions across modalities.
工作流落地Workflow Landing

真正进入 2D / 3D 动画的工业化生产流水线，让生成式系统从繁琐工作流走向更纯粹的创意流。Generative systems must enter the real 2D / 3D animation pipeline — turning a tangled workflow into a clean creative flow.

02 Chapter

Research DirectionsResearch Directions

研究方向Research Directions

我们的研究围绕三条主线协同推进：通用创意智能关注机器创造力的边界，智能创意工作流关注人类创造力的释放方式，科学智能关注生命认知的边界。三者并行构成从创意理解、生产应用到底层生命认知科学探索的完整研究闭环。Our research advances synergistically along three main threads: General Creative Intelligence explores the boundaries of machine creativity; Intelligent Creative Workflow focuses on unleashing human creative potential; and AI for Science probes the frontiers of life cognition. Together, these pillars form a complete research loop—spanning from creative understanding and production applications to foundational scientific exploration in life cognition.

通用创意智能 Creative General Intelligence

Creative General Intelligence Creative General Intelligence

探索机器创造力的边界 —— 从“理解人类创作”出发，赋予AI顶层设计与美学推理的能力。 Exploring the boundaries of machine creativity — starting with 'understanding human creation' to endow AI with top-level design and aesthetic reasoning capabilities.

我们旨在打破传统AI“只能机械生成、无法理解创作”的局限。本方向不仅研究人类在艺术创作中的意图、过程与认知机制，更致力于让AI真正具备类似人类导演或设计师的“设计思维 (Design Thinking)”。 1. 跨模态美学对齐：优秀的艺术作品源于对声音、画面与故事的整体想象。我们的研究核心是实现文字、音乐、色彩、情感与动态效果等跨模态设计元素的深度对齐，赋予AI顶层的美学感知和全景式构思能力，使其不仅能生成素材，更能理解创作背后的美学逻辑。 2. 大模型的可解释性与轻量化：从微观（神经元激活表征）、中观（子网络与注意力机制协同）到宏观（整体泛化与涌现能力）三个尺度，深度剖析生成式大模型的内在机理。通过揭示这些“黑盒”规律，指导算法优化与架构精简，实现“以小博大”的高效、轻量化计算。 We aim to break through the traditional limitation of AI being confined to mechanical generation without an understanding of the creative process. This direction not only investigates human intent, processes, and cognitive mechanisms in artistic creation but also strives to grant AI a 'Design Thinking' capability akin to that of human directors or designers. 1. Cross-modal Aesthetic Alignment: Outstanding artworks stem from a holistic imagination of sound, visuals, and narrative. Our core research focuses on the deep alignment of cross-modal design elements—such as text, music, color, emotion, and motion—endowing AI with top-level aesthetic perception and panoramic conceptualization abilities. This enables AI to generate assets while truly understanding the underlying aesthetic logic of creation. 2. Interpretability and Lightweighting of Large Models: We conduct a multi-scale analysis of generative large models—from the micro level (neuron activation patterns), through the meso level (sub-network and attention mechanism synergy), to the macro level (overall generalization and emergent capabilities). By demystifying these 'black boxes,' we guide algorithmic optimization and architectural refinement to achieve efficient, lightweight computing that 'does more with less.'

多模态感知 Multimodal Perception
跨模态对齐 Cross-modal Alignment
创作推理 Creative Reasoning
设计思维 Design Thinking
美学感知 Aesthetic Perception

智能创意工作流 Intelligent Creative Workflow

Intelligent Creative Workflow Intelligent Creative Workflow

探索人类创造力的边界 —— 从“数字资产高效生产”出发，将创意智能体融入工业流水线，解放人类生产力。 Exploring the boundaries of human creativity — starting with 'efficient digital asset production' to integrate creative agents into industrial pipelines and liberate human productivity.

我们专注于将前沿的AI能力转化为高效的生产力工具。通过解决2D/3D动画及影视制作中耗时费力的机械性劳动，让创作者从繁琐的重复性工作中抽离出来，专注于灵感本身。 1. 全维度多模态交互：我们的研究横跨一维音频、二维原画、三维几何资产到四维时空视频。通过打破单一的“图文驱动 (Text/Image-to-X)”局限，拓展更符合艺术家直觉的交互方式（如动作、音频或多模态协同驱动）。 2. 生产应用范式的革命：推动生成式AI与现有成熟工业流流水线的深度融合，解决视觉预览、冷启动3D建模、手工补间帧等具体痛点，实现从繁琐的“工作流 (Workflow)”到纯粹的“创意流 (Creative Flow)”的范式转变。 We focus on translating cutting-edge AI capabilities into efficient productivity tools. By addressing the labor-intensive, mechanical tasks in 2D/3D animation and film production, we enable creators to detach from tedious repetitive work and refocus on inspiration itself. 1. Full-dimensional Multimodal Interaction: Our research spans 1D audio, 2D concept art, 3D geometric assets, and 4D spatiotemporal video. Moving beyond single-modality 'Text/Image-to-X' paradigms, we explore interaction methods that align with artists' intuition, such as motion-driven, audio-driven, or multimodal collaborative generation. 2. Revolutionizing Production Paradigms: We promote the deep integration of generative AI with established industrial pipelines. By solving specific pain points—such as visual previews, cold-start 3D modeling, and manual in-betweening—we facilitate a paradigm shift from cumbersome 'workflows' to pure 'creative flow.'

全维度多模态交互 Full-dimensional Multimodal Interaction
2D/3D/4D 内容 2D/3D/4D Content
数字资产生产 Digital Asset Production
影视动画工作流 Film & Animation Workflow
工业流水线融合 Industrial Pipeline Integration

科学智能 AI for Science

AI for Science / Neuroscience AI for Science / Neuroscience

探索生命认知的边界 —— 借鉴神经生物学与认知科学，开展“类脑研究”与“科学赋能”的双向互哺。 Exploring the boundaries of life cognition — leveraging neurobiology and cognitive science to foster a bidirectional symbiosis between 'brain-inspired research' and 'scientific empowerment.'

我们聚焦于人工智能与神经科学的交叉前沿。通过将计算科学与生物智能深度结合，寻找下一代AI架构的突破口，同时利用AI技术拓展人类对大脑的认知边界。 1. 向内借鉴 (仿真驱动的类脑研究)：引入前沿神经科学理论，将复杂的生物学神经机制转化为可计算的数学模型。通过构建对应大脑不同功能区及脑网络的数学模型与神经网络算法，寻求AI模型架构层面的突破。 2. 向外赋能 (AI驱动的神经科学)：利用先进的生成式AI与数据分析技术，反哺神经科学研究。辅助科学家探索人类大脑复杂的神经连接机制和疾病机理，在脑科学领域发掘并解决具有重大价值的科学难题，并推动临床应用。 We focus on the intersection of artificial intelligence and neuroscience. By deeply integrating computational science with biological intelligence, we seek breakthroughs for next-generation AI architectures while utilizing AI technology to expand the boundaries of human understanding of the brain. 1. Inward Inspiration (Simulation-driven Brain-inspired Research): Introducing advanced neuroscience theories, we translate complex biological neural mechanisms into computable mathematical models. By constructing mathematical models and neural network algorithms corresponding to different functional areas and brain networks, we seek architectural breakthroughs in AI models. 2. Outward Empowerment (AI-driven Neuroscience): Utilizing advanced generative AI and data analytics, we give back to neuroscience research. We assist scientists in exploring the complex neural connectivity mechanisms and disease pathologies of the human brain, aiming to solve significant scientific challenges in brain science and promote clinical applications.

类脑研究 Brain-inspired Research
神经科学 Neuroscience
双向互哺 Bidirectional Symbiosis

03 Chapter

Publications & ProjectsPublications & Projects

代表成果Featured Work

每项成果都试图回答一个具体的问题，而不仅仅是一次实验。下面是我们目前最具代表性的方向锚点。Each piece of work answers a specific question rather than running an isolated experiment. The following are the anchor points of our current research line.

ViStoryBench

面向视觉故事讲述的综合性基准测试。A Comprehensive Benchmark Suite for Story Visualization.

针对定制化视觉故事讲述这一电影生成的核心任务，构建系统性评测框架，衡量模型在角色一致性、风格稳定性、剧情对齐、构图质量与故事可视化方面的真实能力。Targeting customized visual storytelling — the core task of cinematic generation — ViStoryBench defines a systematic evaluation framework for character consistency, style stability, plot alignment, compositional quality, and visualization.

PaperPaper CodeCode DatasetDataset Project PageProject Page

StyleMe3D

3D 高斯下的解耦先验风格化。Stylization with Disentangled Priors on 3D Gaussians.

在 3D 高斯表示下探索可控、可交互、可扩展的风格化生成，通过多编码器与解耦先验实现稳定的风格控制，体现团队对 3D 作为创作媒介的长期判断。Multi-encoder, disentangled-prior stylization for 3D Gaussian representations — exploring controllable, interactive, scalable 3D creation as a long-term creative medium.

PaperPaper Project PageProject Page CodeCode

Awesome-PaperDaily

持续追踪 AIGC 前沿进展的开源知识整理项目。Following the advance of AIGC, day by day.

体现团队在知识组织、趋势追踪与社区共享方面的持续投入，不仅服务于内部研究，也希望与更广泛社区共同推动 AIGC 发展。An open knowledge curation project that tracks AIGC frontiers — reflecting our continued investment in knowledge organization, trend tracking, and community contribution.

CodeCode

04 Chapter

TeamTeam

团队成员Team Members

AIGC Research 由一支关注创意人工智能、生成式模型与跨学科基础研究的小型团队组成，并与工业界、学术界导师保持持续合作，共同推进从基础问题到创作系统落地的全链路研究。AIGC Research is a small team focused on creative AI, generative models, and cross-disciplinary fundamentals. We collaborate with both industry and academic advisors across the full chain — from foundational questions to deployed creative systems.

庄才林Cailin Zhuang

团队成员Team Member

上海科技大学硕士，复旦大学博0M.Sc., ShanghaiTech University; Ph.D. Year 0, Fudan University

研究方向为 AIGC，关注创意人工智能与影视级内容生成。Working on AIGC, creative intelligence, and cinematic content generation.

Personal GitHub Scholar Email

胡耀淇Yaoqi Hu

团队成员Team Member

——

专注于开源社区贡献（Hugging Face）、企业级数据工作流开发，并在多媒体与视觉生成领域具有深厚的研究积累。Focuses on open-source community contributions on Hugging Face, enterprise data workflow development, and multimedia and visual generation research.

Personal GitHub Email

董政Zheng Dong

团队成员Team Member

——

简介待补充。Bio coming soon.

程巍Wei Cheng

工业界导师Industry Advisor

阶跃星辰 · 研究科学家StepFun · Research Scientist

研究方向为生成式人工智能。Research on generative AI.

Scholar

李梦甜Mengtian Li

学术界导师Academic Advisor

上海大学 · 上海电影学院Shanghai University · Shanghai Film Academy

研究方向为AI电影。Research on AI filmmaking.

Personal Scholar

夏清玲Qingling Xia

学术界导师Academic Advisor

重庆理工大学Chongqing University of Technology

研究方向为生物医学工程。Research on biomedical engineering.

Scholar

朱思语Siyu Zhu

学术界导师Academic Advisor

复旦大学Fudan University

研究方向为视频世界模型。Research on video world models.

Personal Scholar

05 Chapter

Get in TouchGet in Touch

合作与联系Collaborate with Us

如果您对我们的研究方向、项目合作、学术交流、开源共建或创意生产系统落地感兴趣，欢迎与我们取得联系。让我们共同塑造一个全新的创意世界。If our research directions, open-source efforts, academic exchanges, or creative production systems resonate with you, we warmly invite you to connect. Let's shape a novel creative world together.

邮件联系Email Us journeyzhuang@gmail.com

微信公众号WeChat

扫码关注 AIGC ResearchScan to follow AIGC Research

Channels

创意人工智能研究组 AI4C Research Team

使命与理念 Mission & Philosophy

研究团队 Our Team

学术成果 Publications

未来方向 Future Works

电影级标准Cinematic Standard

设计思维Design Thinking

工作流落地Workflow Landing

ViStoryBench

StyleMe3D

Awesome-PaperDaily

庄才林Cailin Zhuang

胡耀淇Yaoqi Hu

董政Zheng Dong

程巍Wei Cheng

李梦甜Mengtian Li

夏清玲Qingling Xia

朱思语Siyu Zhu