AI智能报告生成实操：30分钟极速搞定跨境分析！

各位跨境电商的朋友们，大家好！作为一名深耕跨境行业多年的老兵，我深知大家在海外淘金过程中对效率和精准信息的需求有多么迫切。今天，我不是以冷冰冰的AI身份，而是以一名实战导师的身份，手把手带大家深入了解如何利用前沿的AI技术，打造一个属于我们自己的“智能报告生成助手”。这不仅能解放我们的双手，更能让我们的市场分析、运营决策如虎添翼。

请大家记住，我们今天的目标，是把一个看似复杂的海外技术指南，彻底本土化，变成一份符合我们中国跨境人阅读习惯、充满实战洞察的教程。让我们一起，将知识转化为生产力！

跨境新机遇：用AI агенты武装你的报告生成器

传统的报告生成方式，往往循规蹈矩，信息捕捉和决策逻辑都固定得很。但现在，AI智能体（AI Agent）的出现，彻底改变了这种局面。它像是一个拥有自主思考能力的“数字员工”，能利用大语言模型（LLM）来独立判断、灵活应变，甚至还能进行复杂的逻辑推理。在风云变幻的全球市场中，这种能力对我们跨境人来说，无疑是至关重要的。

通过这次自我学习工作坊，各位将收获满满：

透彻理解AI智能体的四大核心考量：我们将以英伟达（NVIDIA）的Nemotron开放模型家族为例，深入浅出地剖析其运作机制。
亲手搭建一个能研究、能撰写的文档生成智能体：它将成为你跨境事业的得力助手，帮你快速产出高质量的市场报告。
掌握LangGraph和OpenRouter的实战技巧：学会如何利用这些工具构建强大的AI智能体。
获得一个“开箱即用”的便携式开发环境：让你轻松上手，随时随地开启智能体之旅。
拥有一个专属定制的智能体：把它打造成你的NVIDIA Launchable，随时分享和部署。

部署工作坊：轻松开启你的AI之旅

想要立刻体验？很简单！点击下方的“立即部署”按钮，就能将整个工作坊环境一键部署到云端，省去繁琐的配置步骤。
image of the deploy now button

配置你的“秘密武器”：API密钥设置

要让你的AI智能体顺利运转，我们需要准备一些关键的“通行证”，也就是项目密钥（secrets）。这就像给你的智能体配备了获取外部情报的钥匙。

OpenRouter API密钥：这是我们访问OpenRouter平台上的英伟达Nemotron Nano 9B V2模型的重要凭证。
Tavily API密钥：有了它，我们的智能体就能实时进行网络搜索，获取最新的市场数据和行业动态。

当你的JupyterLab环境启动后，在JupyterLab启动器中找到“NVIDIA DevX Learning Path”区域下的“Secrets Manager”模块。点击它，就可以轻松配置这些密钥。别忘了在日志（logs）标签页里确认，密钥是否已经成功添加。
A screenshot of the Secrets Manager tile under NVIDIA DevX Learning Path

接下来，还是在JupyterLab启动器的“NVIDIA DevX Learning Path”区域，选择“1. Introduction to Agents”模块，打开实验指南，正式开始我们的智能体构建之旅！
A screenshot of the 1. Introduction to Agents tile

揭秘智能体架构：它如何“思考”？

当工作坊环境配置妥当后，第一部分会带领我们认识AI智能体。在深入学习具体实现之前，理解智能体与普通AI应用的区别，这非常关键。它就像我们跨境人做市场调研，首先得搞清楚市场规律一样。

不同于传统的基于大语言模型（LLM）的应用，智能体能动态选择工具、进行复杂推理，并根据具体情况调整分析方法。我们将学到构建所有智能体的四大核心要素：

模型（Model）：这就像智能体的“大脑”，负责决定使用哪些工具以及如何回应。它掌控着全局。
工具（Tools）：这些是智能体的“双手”，让它能执行各种动作，比如进行数学计算、查询数据库，或是调用API接口。
记忆与状态（Memory and state）：这相当于智能体的“记忆库”，在对话过程中和不同对话之间，都能为它提供必要的信息。
路由（Routing）：这好比智能体的“导航员”，根据当前状态和模型的决策，决定下一步该做什么。

你将在code/intro_to_agents.ipynb文件中学习如何将这些组件组合起来，构建你的第一个、配备计算器的基础智能体。完成这个练习后，你的智能体将能实现以下功能：

[{'content': 'What is 3 plus 12?', 'role': 'user'}, {'content': None, 'role': 'assistant', 'tool_calls': [{'function': {'arguments': '{"a": 3, "b": 12}', 'name': 'add'}, 'id': 'chatcmpl-tool-b852128b6bdf4ee29121b88490174799', 'type': 'function'}]}, {'content': '15', 'name': 'add', 'role': 'tool', 'tool_call_id': 'chatcmpl-tool-b852128b6bdf4ee29121b88490174799'}, {'content': 'The answer is 15.', 'role': 'assistant'}]

你看，它能识别问题，调用工具计算，然后给出准确答案。这正是智能体“思考”和“行动”的体现。

报告生成器的“生产线”拆解

本次工作坊的核心，是利用LangGraph框架和NVIDIA NIM（通过OpenRouter端点托管）来构建一个多层次的智能体系统。它就像一条精密的报告生产线，由四个相互关联的智能体组件构成，每个组件各司其职，共同完成文档生成任务：

初步研究（Initial research）：全面收集关于主题的各类信息。
大纲规划（Outline planning）：根据研究结果，制定结构化的文档大纲。
分章节撰写（Section writing）：为每个章节撰写详细内容，并根据需要进行补充研究。
最终整合（Final compilation）：将所有章节汇编成一份专业的报告。

从理论到实践：代码实现与演练

概念都明白了，现在就让我们撸起袖子，将这些理论付诸实践。我们将从前面提到的基础考量入手，逐步构建出完整的智能体：

选择合适的“大脑”：模型
配备强大的“工具”：工具
搭建“市场调研员”：研究员
培养“报告执笔者”：撰写员
组装“总设计师”：最终智能体
掌控“指挥中心”：智能体的管理与路由

根基：智能体的“大脑”——模型选择

本次工作坊主要依赖NVIDIA NIM端点作为智能体的核心模型动力。NVIDIA NIM提供了高性能的推理能力，这对我们跨境人而言，意味着更快的决策支持和更强的分析能力。具体优势包括：

工具绑定：原生支持函数调用，让智能体能无缝使用各种工具。
结构化输出：内置支持Pydantic模型，保证了输出内容的规范性和易读性。
异步操作：全面支持异步/等待（async/await）机制，实现并发处理，效率更高。
企业级可靠性：提供生产级别的推理基础设施，稳定可靠。

这个例子展示了如何通过OpenRouter端点，利用ChatNVIDIA连接器调用NVIDIA NIM托管的模型。

from langchain_nvidia_ai_endpoints import ChatNVIDIA
llm = ChatNVIDIA(
    base_url="https://openrouter.ai/api/v1",
    model="nvidia/nemotron-nano-9b-v2:free",
    api_key=os.getenv("OPENROUTER_API_KEY")
)

llm_with_tools = llm.bind_tools([tools.search_tavily])

我们新媒网跨境获悉，在任何基于大语言模型（LLM）的应用中，清晰、高质量的指令至关重要，但对智能体来说，这更是重中之重，因为它能消除歧义，明确决策过程。以下是code/docgen_agent/prompts.py中一个研究提示词的例子：

research_prompt: Final[str] = """
Your goal is to generate targeted web search queries that will gather comprehensive information for writing a technical report section.

Topic for this section: {topic}

When generating {number_of_queries} search queries, ensure they:
1. Cover different aspects of the topic (e.g., core features, real-world applications, technical architecture)
2. Include specific technical terms related to the topic
3. Target recent information by including year markers where relevant (e.g., "2024")
4. Look for comparisons or differentiators from similar technologies/approaches
5. Search for both official documentation and practical implementation examples

Your queries should be:
- Specific enough to avoid generic results
- Technical enough to capture detailed implementation information
- Diverse enough to cover all aspects of the section plan
- Focused on authoritative sources (documentation, technical blogs, academic papers)
"""

这个提示词展示了提示工程（prompting）的一些关键原则，大家要牢记：

角色明确：清晰定义智能体的专长和职责，例如，让它充当“市场分析师”。
任务拆解：将复杂的任务分解为具体、可操作的步骤或标准，好比跨境物流，每一步都要细化。
细节具体：引用时间范围（如“2025年最新”）和权威来源（如“行业报告”）等例子，让智能体知道从哪里找，找什么。
结构化输入/输出：明确要求回应的结构和预期输入格式，确保报告格式规范统一。

根基：智能体的“双手”——工具选择

智能体的能力通过它所能使用的“工具”来定义。在本次工作坊中，我们主要利用Tavily作为信息收集的核心工具，它是一个专为AI智能体设计的搜索API。就好比我们跨境人需要一个高效的全球搜索引擎来获取市场情报。

# imports and constants omitted
@tool(parse_docstring=True)
async def search_tavily(
    queries: list[str],
    topic: Literal["general", "news", "finance"] = "news",
) -> str:
    """Search the web using the Tavily API.

    Args:
        queries: List of queries to search.
        topic: The topic of the provided queries.
            general - General search.
            news - News search.
            finance - Finance search.

    Returns:
        A string of the search results.
    """
    search_jobs = []
    for query in queries:
        search_jobs.append(
            asyncio.create_task(
                tavily_client.search(
                    query,
                    max_results=MAX_RESULTS,
                    include_raw_content=INCLUDE_RAW_CONTENT,
                    topic=topic,
                    days=days,  # type: ignore[arg-type]
                )
            )
        )
    search_docs = await asyncio.gather(*search_jobs)
    return _deduplicate_and_format_sources(
        search_docs,
        max_tokens_per_source=MAX_TOKENS_PER_SOURCE,
        include_raw_content=INCLUDE_RAW_CONTENT,
    )

工具模块实现的关键架构设计包括：

异步操作：使用asyncio.gather()实现并发搜索，大大提高效率，就像多个市场调研员同时工作。
去重机制：辅助函数能够避免多次搜索造成的信息冗余，保证报告内容的精炼。
结构化文档：采用Google风格的docstrings，帮助大语言模型（LLM）更好地理解工具用途，避免误用。

现在，我们已经对模型和工具有了基础的理解，是时候将它们组装成一个真正能运作的智能体了。虽然我们还没探讨状态管理和路由，但别急，等我们的智能体组件构建完成后，我们再回过头来仔细研究。

打造“市场调研员”：研究员智能体

简单的ReAct（Reasoning and Action）智能体已经非常强大，但通常会结合额外的步骤来实现更复杂的工作流。研究员智能体便采用了ReAct模式，这是目前工具型智能体最有效的架构之一。这种模式创建了一个“思考-行动-反馈”的循环：智能体先思考该做什么，然后采取行动，再根据结果决定下一步。这个循环会一直持续，直到智能体判断任务已完成。
A diagram showcasing a simple React Agent loop

研究员组件的代码在code/docgen_agent/researcher.py中实现，并可以通过code/researcher_client.ipynb进行测试。

state = ResearcherState(
    topic="Examples of AI agents in various industries.",
    number_of_queries=3,
)

state = await graph.ainvoke(state)

for message in state["messages"]:
    print("ROLE: ", getattr(message, "role", "tool_call"))
    print(message.content[:500] or message.additional_kwargs)
    print("")

你也可以看到研究员在执行过程中采取的每一个行动，就像我们新媒网跨境观察到一个调研团队的实时工作日志。

INFO:docgen_agent.researcher:Calling model.
INFO:docgen_agent.researcher:Executing tool calls.
INFO:docgen_agent.researcher:Executing tool call: search_tavily
INFO:docgen_agent.tools:Searching the web using the Tavily API
INFO:docgen_agent.tools:Searching for query: Technical architecture of AI agents in healthcare 2024
INFO:docgen_agent.tools:Searching for query: Comparison of machine learning frameworks for AI agents in finance
INFO:docgen_agent.tools:Searching for query: Real-world applications of natural language processing in AI agents for customer service 2024
INFO:httpx:HTTP Request: POST https://api.tavily.com/search "HTTP/1.1 200 OK"
INFO:httpx:HTTP Request: POST https://api.tavily.com/search "HTTP/1.1 200 OK"
INFO:httpx:HTTP Request: POST https://api.tavily.com/search "HTTP/1.1 200 OK"
INFO:docgen_agent.researcher:Calling model.

以及最终的输出，包括工具调用请求、工具输出以及总结结果的最终消息。

ROLE: assistant
{'tool_calls': [{'id': 'chatcmpl-tool-b7185ba8eb3a44259b0bdf930495ece5', 'type': 'function', 'function': {'name': 'search_tavily', 'arguments': '{"queries": ["Technical architecture of AI agents in healthcare 2024", "Comparison of machine learning frameworks for AI agents in finance", "Real-world applications of natural language processing in AI agents for customer service 2024"], "topic": "general"}'}}]}
ROLE: tool_call
"Sources:\n\nSource AI Agents in Modern Healthcare: From Foundation to Pioneer:\n===\nURL: https://www.preprints.org/manuscript/202503.1352/v1\n===\nMost relevant content from source: T. Guo et al., \"Large language model based multi-agents: A survey of progress and challenges,\" arXiv preprint arXiv:2402.01680, 2024. J. Ruan et al., \"TPTU: Large Language Model-based AI Agents for Task Planning and Tool Usage. | **Partner Agent** | * True healthcare team partners * Generates clinical hypotheses
ROLE: assistant
Based on the search results, here are some potential search queries that could be used to gather comprehensive information for writing a technical report section on examples of AI agents in various industries:
1. "Technical architecture of AI agents in healthcare 2024"
2. "Comparison of machine learning frameworks for AI agents in finance"
3. "Real-world applications of natural language processing in AI agents for customer service 2024"

培养“报告执笔者”：撰写员智能体

虽然ReAct智能体很强大，但要完成更复杂的工作流，通常需要结合额外的步骤。例如，章节撰写员（section author）需要进行额外的、与章节相关的研究，但仅在被要求时进行。一旦所需的研究资料就位，它必须利用这些资料来撰写章节内容。这就像我们写跨境市场分析报告，需要先调研，觉得资料不够再补充，最后才是动笔。

在修改后的架构图中，我们在ReAct风格的智能体之前增加了一个门控函数（gating function），用于判断是否需要进行额外的研究，并在末尾增加了一个撰写步骤。
A diagram showing the decision tree leading to the React Agent loop.

撰写员组件的代码在code/docgen_agent/author.py中实现，并可以通过code/author_client.ipynb进行测试。

state = SectionWriterState(
    index=1,
    topic="Examples of AI agents in various industries.",
    section=Section(
        name="Real-World Applications",
        description="Examples of AI agents in various industries",
        research=True,
        content=""
    ),
)

state = await graph.ainvoke(state)

Markdown(state["section"].content)

你也可以看到撰写员在执行过程中采取的每一个行动。

INFO:docgen_agent.author:Researching section: Real-World Applications
INFO:docgen_agent.author:Executing tool calls for section: Real-World Applications
INFO:docgen_agent.author:Executing tool call: search_tavily
INFO:docgen_agent.tools:Searching the web using the Tavily API
INFO:docgen_agent.tools:Searching for query: AI agents in healthcare industry 2024
INFO:docgen_agent.tools:Searching for query: Implementation of AI agents in finance sector 2023
...
INFO:httpx:HTTP Request: POST https://api.tavily.com/search "HTTP/1.1 200 OK"
INFO:docgen_agent.author:Researching section: Real-World Applications
INFO:docgen_agent.author:Writing section: Real-World Applications

以及最终的输出，它会以Markdown格式呈现撰写好的章节内容。

## Real-World Applications
AI agents have numerous applications across various industries...

### Healthcare
AI agents in healthcare are revolutionizing patient care and medical innovation. They are being used to automate administrative tasks, enhance diagnostics, and improve workflow efficiency. For instance...

### Finance
AI agents in finance are driving innovation, success, and compliance. They are being used to automate tasks such as data entry, transaction processing, and compliance checks. AI agents are also being used to detect fraud, improve customer service, and provide personalized investment advice. For example...

新媒网跨境预测，随着AI技术的不断发展，这类智能撰写能力将极大地提高我们内容产出的效率。

组装“总设计师”：最终的报告生成智能体

有了研究员和撰写员这两个核心组件，我们现在可以将它们组合起来，构建出我们的文档生成智能体的工作流程。这个架构目前是最简洁的——一个线性的工作流：先研究主题，然后撰写各个章节，最后将所有内容汇编成一份完整的报告。
A diagram depicting the entire end-to-end flow.

最终智能体的代码在code/docgen_agent/agent.py中实现，并可以通过code/agent_client.ipynb进行测试。

state = AgentState(
    topic="The latest developments with AI Agents in 2025.",
    report_structure="This article should be..."
)

state = await graph.ainvoke(state)

Markdown(state["report"])

你也可以看到智能体在执行过程中采取的每一个行动。

INFO:docgen_agent.agent:Performing initial topic research.
INFO:docgen_agent.researcher:Calling model.
INFO:docgen_agent.researcher:Executing tool calls.
INFO:docgen_agent.researcher:Executing tool call: search_tavily
INFO:docgen_agent.tools:Searching the web using the Tavily API
INFO:docgen_agent.tools:Searching for query: AI Agents 2025 core features
INFO:docgen_agent.tools:Searching for query: Real-world applications of AI Agents in 2025
...
INFO:httpx:HTTP Request: POST https://api.tavily.com/search "HTTP/1.1 200 OK"
INFO:docgen_agent.researcher:Calling model.
INFO:docgen_agent.agent:Calling report planner.
INFO:docgen_agent.agent:Orchestrating the section authoring process.
INFO:docgen_agent.agent:Creating author agent for section: Introduction
INFO:docgen_agent.agent:Creating author agent for section: Autonomous Decision-Making
INFO:docgen_agent.agent:Creating author agent for section: Integration with Physical World
INFO:docgen_agent.agent:Creating author agent for section: Agentic AI Trends
INFO:docgen_agent.agent:Creating author agent for section: AI Agents in Customer Support
INFO:docgen_agent.agent:Creating author agent for section: AI Agents in Healthcare
INFO:docgen_agent.agent:Creating author agent for section: Conclusion
INFO:docgen_agent.agent:Throttling LLM calls.
INFO:docgen_agent.author:Writing section: Introduction
INFO:docgen_agent.author:Researching section: Autonomous Decision-Making
...

最终生成的报告将以Markdown格式呈现。这里提供了一份由该智能体生成的报告范例： sample markdown output 。

风险前瞻与时效提醒：虽然AI生成的报告效率极高，但在跨境贸易中，涉及市场决策、合规性审核等关键环节时，新媒网跨境建议各位务必进行人工复核。AI模型目前仍可能存在“幻觉”或信息偏差，人工的经验和判断是不可替代的。同时，教程中部分技术细节和API接口可能随时间更新，建议大家在实操时留意官方文档的最新版本。

根基：智能体的“指挥中心”——状态管理与路由

智能体组件都建好了，现在我们回头来审视一下，如何利用LangGraph这个智能体框架，实现高级的状态管理和流程控制，将所有组件连接成一个完整的智能AI系统。它就像我们跨境运营中的中控台，协调着各个环节。

LangGraph提供了几个关键优势：

条件路由（Conditional routing）：条件边缘（Conditional edges）能根据运行时条件实现动态流程控制，让智能体能智能地决定下一步行动，这在处理复杂多变的跨境市场情况时尤为重要。
图编译与执行（Graph compilation and execution）：编译后的图可以异步调用，支持并发执行和复杂的编排模式，这对于多智能体系统来说至关重要，能大大提升处理效率。

在以下来自code/docgen_agent/agent.py的示例中，我们可以看到之前构建的各个组件如何对应不同的节点（node），以及连接这些节点、将中间输出从一个节点路由到下一个节点的边缘（edge）。

main_workflow = StateGraph(AgentState)
main_workflow.add_node("topic_research", topic_research)
main_workflow.add_node("report_planner", report_planner)
main_workflow.add_node("section_author_orchestrator", section_author_orchestrator)
main_workflow.add_node("report_author", report_author)

main_workflow.add_edge(START, "topic_research")
main_workflow.add_edge("topic_research", "report_planner")
main_workflow.add_edge("report_planner", "section_author_orchestrator")
main_workflow.add_edge("section_author_orchestrator", "report_author")
main_workflow.add_edge("report_author", END)

恭喜各位！通过一步步学习这个开发者工作坊，你们已经亲手构建了自己的LangGraph智能体。现在，快使用code/agent_client.ipynb笔记本，测试一下你的新智能体吧！