Evaluation Flow overview
Evaluation flows are special types of flows that assess how well the outputs of a run align with specific criteria and goals by calculating metrics.
In prompt flow, you can customize or create your own evaluation flow and metrics tailored to your tasks and objectives, and then use it to evaluate other flows
-
Go to AI Studio AI Studio.
-
Once inside your project, select Evaluation from the left dropdown menu.
-
From your Evaluation view, select New evaluation in the middle of the page.
-
From here you can create, name a new evaluation and select your scenario.
-
Select the flow you want to evaluate. (To evaluate the DraftFlow select DraftFlow here)
-
Select metrics you would like to use. Also, be sure to select an active Connection and active Deployment name/Model.
-
Use an existing dataset or upload a dataset to use in evaluation. (Upload the provided dataset found in \Deployment\data\EvaluationDataset.csv)
Once the flow has been ran successfully, the metrics will be displayed showing a 1-5 score of each respective metric. From here, you can click into the evaluation flow to get a better understanding of the scores.