以下是关于 Prompt 工程的详细介绍:
Prompt 工程是一门经验科学,涉及迭代和测试提示以优化性能。在提示工程周期中,大部分精力并非用于编写提示,而是花在开发一组强大的评估上,然后针对这些评估进行测试和迭代。
Claude 在开箱即用时提供了高水平的基线性能,但提示工程可以帮助进一步提升其性能,并对其响应进行微调,以更好地适应特定的用例。这些技术对于实现 Claude 的良好结果不是必要的,但可能对提升输入和输出有用。若要快速开始使用提示或了解提示作为一个概念,可以参阅提示入门。
提示开发生命周期包括:
[title]2.Prompt engineering 提示工程[heading1]What is prompt engineering? 什么是提示工程?[content]Prompt engineering is an empirical science that involves iterating and testing prompts to optimize performance.Most of the effort spent in the prompt engineering cycle is not actually in writing prompts.Rather,the majority of prompt engineering time is spent developing a strong set of evaluations,followed by testing and iterating against those evals.提示工程是一门经验科学,涉及迭代和测试提示以优化性能。在提示工程周期中花费的大部分精力实际上并不是在编写提示上。相反,大部分提示工程时间都花在开发一组强大的评估上,然后针对这些评估进行测试和迭代。
Claude offers high-level baseline performance out of the box.However,prompt engineering can help you enhance its performance further and fine-tune its responses to better suit your specific use case.These techniques are not necessary for achieving good results with Claude,but you may find them useful in upleveling your inputs & outputs.Claude在开箱即用时提供了高水平的基线性能。然而,及时的工程可以帮助您进一步提升其性能,并对其响应进行微调,以更好地适应您特定的用例。这些技术对于实现Claude的良好结果并不是必要的,但您可能会发现它们对提升输入和输出非常有用。To quickly get up and running with a prompt or get introduced to prompting as a concept,see[intro to prompting](https://docs.anthropic.com/claude/docs/intro-to-prompting).要快速开始使用提示或了解提示作为一个概念,可以参阅提示入门。
[title]2.Prompt engineering 提示工程[heading1]What is prompt engineering? 什么是提示工程?[heading2]The prompt development lifecycle 提示开发生命周期[content]3.Engineer the preliminary prompt:Next,craft an initial prompt that outlines the task definition,characteristics of a good response,and any necessary context for Claude.Ideally you should add some examples of canonical inputs and outputs for Claude to follow.This preliminary prompt will serve as the starting point for refinement.设计初步提示:接下来,制定一个初步的提示,概述任务定义、良好响应的特征,以及Claude所需的任何上下文。理想情况下,您应该添加一些规范输入和输出的示例供Claude参考。这个初步提示将作为改进的起点。4.Test prompt against test cases:Feed your test cases into Claude using the preliminary prompt.Carefully evaluate the model's responses against your expected outputs and success criteria.Use a consistent grading rubric,whether it's human evaluation,comparison to an answer key,or even another instance of Claude’s judgement based on a rubric.The key is to have a systematic way to assess performance.根据测试用例测试提示:使用初步提示将测试用例输入到Claude中。仔细评估模型的响应与您预期的输出和成功标准是否一致。使用一致的评分标准,无论是人工评估、与答案标准的比较,甚至是基于评分标准的Claude判断的另一个实例。关键是要有一种系统性的评估性能的方式。