Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to content

Latest commit

 

History

History
27 lines (26 loc) · 589 Bytes

api-management-llm-semantic-cache-example.md

File metadata and controls

27 lines (26 loc) · 589 Bytes
author ms.service ms.custom ms.topic ms.date ms.author
dlepow
azure-api-management
build-2024
include
08/21/2024
danlep
<policies>
    <inbound>
        <base />
        <llm-semantic-cache-lookup
            score-threshold="0.05"
            embeddings-backend-id ="llm-backend"
            embeddings-backend-auth ="system-assigned" >
            <vary-by>@(context.Subscription.Id)</vary-by>
        </llm-semantic-cache-lookup>
    </inbound>
    <outbound>
        <llm-semantic-cache-store duration="60" />
        <base />
    </outbound>
</policies>