Optimize Azure OpenAI Applications with Semantic Caching

Introduction One of the ways to optimize cost and performance of Large Language Models (LLMs) is to cache the responses from LLMs, this is sometimes referred to as “semantic caching”. In this blog, we will discuss the approaches, benefits, common scenarios and key considerations for using semantic caching. What…

10/04/2024Azure Architecture Blog

Share:

You may be interested in

Enhancing Document Extraction with Azure AI Document Intelligence and LangChain for RAG Workflows.
Azure Architecture Blog,
18/07/2024
Demystifying Azure OpenAI Networking for Secure Chatbot Deployment
Azure Architecture Blog,
Introduction Azure AI Landing Zones provide a solid foundation for deploying advanced AI technologies like OpenAI's GPT-4 models. These environments are designed to support AI enthusiasts, but it's essential to…
20/09/2023
Harnessing Generative AI with Weaviate on Azure Kubernetes Service and Azure NetApp Files
Azure Architecture Blog,
Table of Contents Introduction Prerequisites Install Weaviate Approximate Nearest Neighbor (ANN) Benchmarks ANN Benchmarks Setup ANN Benchmarks – Glove 100 Angular ANN Benchmarks – Sift 128 Euclidean ANN Benchmarks Analysis…
11/09/2024
New feature: easily assign regulatory compliance policies to your Azure Landing Zone
Azure Architecture Blog,
We are pleased to announce a new feature for the Azure Landing Zone portal accelerator that will make regulatory compliance at scale more consistent and simple to deploy. Azure Policy…
04/03/2024
Using Azure Load Testing to test Multi-Tenant services
Azure Architecture Blog,
Using Azure Load Testing to test Multi-Tenant services This article describes how to use Azure Load Testing to test a multi-tenant service based on Azure App Service. It also describes how to run…
24/01/2024
Cross-Region Resiliency for Ecommerce Reference Application
Azure Architecture Blog,
Authors: Radu Dilirici (radudilirici@microsoft.com) Ioan Dragan (ioan.dragan@microsoft.com) Ciprian Amzuloiu (camzuloiu@microsoft.com) Introduction The initial Resilient Ecommerce Reference Application demonstrated the best practices to achieve regional resiliency using Azure’s availability zones. Expanding…
24/03/2025
Getting Started with Reliability on Azure: Ensuring Cloud Applications Stay Up and Running
Azure Architecture Blog,
As businesses increasingly rely on cloud services, the imperative for robust cloud solutions has never been greater. Azure stands at the forefront of this realm, offering architects and technology leaders…
27/05/2024
Enhanced Performance and Scalability: Azure AD-joined Session Hosts with Azure NetApp Files
Azure Architecture Blog,
Table of Contents Abstract Introduction Scenario Requirements Configuration Summary Links to Additional Information Abstract In this article, you will learn how to integrate Azure Virtual Desktop (AVD) session…
01/06/2023
Building scalable and persistent AI applications with LangChain, Instaclustr, and Azure NetApp Files
Azure Architecture Blog,
Table of Contents Abstract Introduction Prerequisites Workstation setup Instaclustr PostgreSQL deployment Database configuration Azure OpenAI deployment Using the chatbot Basic question and answer chatbot In-memory persistent chatbot PostgreSQL persistent chatbot…
12/12/2024
Empowering Accessibility: Language and Audio Document Translation Made Simple with Low-Code/No-Code
Azure Architecture Blog,
This solution architecture proposal outlines how to effectively utilize OpenAI's language model alongside Azure Cognitive Services to create a user-friendly and inclusive solution for document translation. By leveraging OpenAI's advanced…
24/05/2023
AI for Operations
Azure Architecture Blog,
Solutions idea This solution series shows some examples of how Azure OpenAI and its LLM models can be used on Operations and FinOps issues. With a view to the use…
10/12/2024
Streamlining data discovery for AI/ML with OpenMetadata on AKS and Azure NetApp Files
Azure Architecture Blog,
Table of Contents Abstract Introduction Prerequisites Workstation setup Repository directory contents Terraform variables file Credentials Azure settings Instaclustr settings VNet settings AKS cluster settings Azure NetApp Files settings PostgreSQL settings…
01/05/2025