Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
SlideShare a Scribd company logo
Iguane Solutions ©2024 - ig1.com
2024.07.02 - Plug n Play Gen AI Platform
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
A New Era
2
IG1
Iguane Solutions ©2024 - all rights reserved - ig1.com
Index
Plug n Play Gen AI Platform Event
3
PLUG N PLAY
GEN AI PLATFORM
CUSTOMER
TESTIMONIAL
QUESTIONS
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
1. Plug n Play Gen AI Platform
4
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Plug n Play Gen AI Platform
Layered Architecture for Gen AI Infrastructure
PLUG N PLAY
GEN AI PLATFORM
Layered Architecture
Concept: Gen AI
5
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Plug n Play Gen AI Platform
6
Context: Artificial Intelligence Era
PLUG N PLAY
GEN AI PLATFORM
Market Size
1
source: https://www.precedenceresearch.com/artificial-intelligence-market
$ 1 807 bn
$ 638bn
2024 2030
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Plug n Play Gen AI Platform
AI Platform Concept
PLUG N PLAY
GEN AI PLATFORM
Layered Architecture
Concept: Gen AI
What is an AI Platform ?
7
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Not everyone can move
to the public cloud or use OpenAI .
1. Plug n Play Gen AI Platform
8
Context: Artificial Intelligence Era
PLUG N PLAY
GEN AI PLATFORM
Context
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Plug n Play Gen AI Platform
Layered Architecture for Gen AI Infrastructure
PLUG N PLAY
GEN AI PLATFORM
Layered Architecture
Concept: Gen AI
1. Hardware & Cloud :
Infrastructure
2. Model Foundation: LLM & RAG usage
3. Integration, Orchestration &
Deployment tooling
4. Gen AI Applications
9
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Plug n Play Gen AI Platform
Layered Architecture for Gen AI Infrastructure / Layer 01: Hardware & Cloud
PLUG N PLAY
GEN AI PLATFORM
Layered Architecture
Concept: Gen AI
Unpacking and Initial Setup
Unpack the Hardware
Carefully unpack the servers, NVidia GPUs, and other hardware components.
Purpose: Ensures that all components are intact and ready for installation.
Rack the Servers
Install the servers into the designated racks in the data center.
Purpose: Provides a secure and organized physical setup.
Connect Power and Networking
Connect the servers to power sources and the data center network.
Purpose: Ensures the servers are powered and networked for subsequent configuration.
Hardware Configuration
Install NVidia GPUs
Physically install the NVidia GPUs into the servers according to the manufacturer's instructions.
Purpose: Provides the necessary hardware acceleration for AI computations.
Verify Hardware Connections
Ensure all connections are secure and components are properly seated.
Purpose: Prevents hardware failures and connectivity issues during operation.
Physical Servers
10
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Plug n Play Gen AI Platform
Layered Architecture for Gen AI Infrastructure / Layer 01: Hardware & Cloud
PLUG N PLAY
GEN AI PLATFORM
Layered Architecture
Concept: Gen AI
Operating System Installation
Install the OS
Install the OS: Install IG1 AI OS, a specially designed operating system tailored for AI services, leveraging
our deep expertise and capability in managing "plug and play" platforms for AI.
Purpose: Provides the underlying operating system for all software and services.
Update the System
Run system updates to ensure all packages are up to date.
Purpose: Ensures the system has the latest security patches and features.
GPU Drivers and CUDA Installation
NVidia Drivers
Install the latest NVidia drivers for the GPUs.
Purpose: Enables the operating system to communicate with the GPUs.
CUDA Toolkit
"CUDA toolkit" is embedded in IG1 OS.
Purpose: Provides the necessary libraries and tools for developing and running GPU-accelerated applications.
Base System
11
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Plug n Play Gen AI Platform
Layered Architecture for Gen AI Infrastructure / Layer 01: Hardware & Cloud
PLUG N PLAY
GEN AI PLATFORM
Layered Architecture
Concept: Gen AI
KUBE by IG1 for AI
Installation and Configuration
Install KUBE by IG1
Follow the installation guide for KUBE by IG1 to set up the virtualization layer.
Purpose: Provides a platform for managing virtual machines and containers.
Configure Networking
Set up networking within KUBE to ensure communication between nodes and external access.
Purpose: Ensures seamless communication and data transfer within the cluster and with external clients.
Cluster Installation
Initialize KUBE Cluster
Initialize the KUBE cluster to create a control plane and add worker nodes.
Purpose: Establishes the core infrastructure for managing containerized applications.
Verify Cluster Health
Check the health and status of the KUBE cluster to ensure all components are functioning correctly.
Purpose: Identifies and resolves any issues before proceeding with further setup.
12
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
LLM Model Setup
LLM Model Setup
Download LLM
Obtain the LLM from the appropriate source.
Purpose: Provides the base AI model for various applications.
LLM Optimization
Optimization consists in optimising resource usage by preparing and enhancing LLMs through a process
called quantization. Quantization increases inference performance without significantly compromising
accuracy. Our quantization management services utilize the AWQ project, which provides excellent
performance in terms of speed and accuracy.
LLMs Inference servers
Similar to database engines, LLMs inference servers run LLMs for inference or embedding. IG1 installs
and manages all the necessary services for the proper functioning of LLM models. For this, we rely on
several instances of:
- VLLM , ideal for models without quantization FP16,
- Nvidia Triton Inference server , for optimized models with Nvidia TensorRT-LLM
- TGI (Text Generation Inference) for Hugging Face models
Plug n Play Gen AI Platform
Layered Architecture for Gen AI Infrastructure / Layer 02: Model Foundation
PLUG N PLAY
GEN AI PLATFORM
Layered Architecture
Concept: Gen AI
13
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
RAG Setup
RAG (Retrieval-Augmented Generation) Setup
Integrate RAG Components
Set up the necessary RAG components (example using the LlamaIndex framework):
- Retriever: Finds the most relevant information from the data.
- Generator: Uses the retrieved information to generate accurate responses.
- Embedding: Transforms data into vector representations to improve retrieval accuracy.
- Reranking: Organizes and prioritizes the retrieved results based on relevance.
Purpose: Enhances the LLM with retrieval-augmented capabilities for more accurate and relevant responses.
Deploy RAG Pipeline
Deploy the RAG pipeline within the KUBE environment.
Purpose: Ensures the RAG system is operational and integrated with the LLM model.
Plug n Play Gen AI Platform
Layered Architecture for Gen AI Infrastructure / Layer 02: Model Foundation
PLUG N PLAY
GEN AI PLATFORM
Layered Architecture
Concept: Gen AI
14
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Integration of AI Services
Integrate various AI services seamlessly to ensure efficient communication and operation.
This includes:
API Integrations
Connect your AI models to various APIs for extended functionalities, including data retrieval, processing,
and user interface interactions.
Data Pipelines
Establish data pipelines to ensure smooth data flow between different components, facilitating real-time
data processing and analysis.
The API Core acts as a Proxy LLM , balancing the load between LLMs inference server instances .
LiteLLM , deployed in High Availability, is used for this purpose. It offers wide support for LLM servers,
robustness, and usage information and API key storage through PostgreSQL . LiteLLM also enables
synchronization between different instances and sends LLM usage information to our
observability tools .
Integration AI
Services
Plug n Play Gen AI Platform
Layered Architecture for Gen AI Infrastructure / Layer 03: Integration, Orchestration & Deployment Tooling
PLUG N PLAY
GEN AI PLATFORM
Layered Architecture
Concept: Gen AI
15
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Observability &
Traceability
Observability and Traceability
Implement observability tools to gain insights into the behavior and performance of your AI applications:
Centralized Logging
Aggregate logs from different services and applications in a central location for easier analysis and
troubleshooting
Metrics Collection
Collect metrics on various aspects of your applications' performance, such as response times, error rates,
and resource usage.
Distributed Tracing
Use distributed tracing to track requests as they flow through different services, helping to identify
bottlenecks and optimize performance..
The LLMs observability layer collects usage data and execution traces, ensuring proper LLM
management . IG1 efficiently manages LLM usage through a monitoring stack connected to the
LLM orchestrator . Lago and OpenMeter collect information , which is then transmitted to our
central observability system, Sismology .
Plug n Play Gen AI Platform
Layered Architecture for Gen AI Infrastructure / Layer 03: Integration, Orchestration & Deployment Tooling
PLUG N PLAY
GEN AI PLATFORM
Layered Architecture
Concept: Gen AI
16
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Chat
Plug n Play Gen AI Platform
Layered Architecture for Gen AI Infrastructure / Layer 04: AI Applications
PLUG N PLAY
GEN AI PLATFORM
Layered Architecture
Concept: Gen AI
17
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Dev Copilot
Plug n Play Gen AI Platform
Layered Architecture for Gen AI Infrastructure / Layer 04: AI Applications
PLUG N PLAY
GEN AI PLATFORM
Layered Architecture
Concept: Gen AI
18
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
1. Plug n Play Gen AI Platform
Context: Artificial Intelligence Era
PLUG N PLAY
GEN AI PLATFORM
Context
92% of US-based developers have already
used an AI assistant for coding 2
92%
2
https://github.blog/2023-06-13-survey-reveals-ais-impact-on-the-developer-experience/
19
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Low Code LLM tool
Plug n Play Gen AI Platform
Layered Architecture for Gen AI Infrastructure / Layer 04: AI Applications
PLUG N PLAY
GEN AI PLATFORM
Layered Architecture
Concept: Gen AI
20
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
API Setup
. . .
Plug n Play Gen AI Platform
Layered Architecture for Gen AI Infrastructure / Layer 04: AI Applications
PLUG N PLAY
GEN AI PLATFORM
Layered Architecture
Concept: Gen AI
21
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Physical Servers Base System KUBE by IG1 for AI
LLM Model Setup RAG Setup
Integration AI
Services
Observability &
Traceability
Chat Dev Copilot Low Code LLM tool API Setup
. . .
Plug n Play Gen AI Platform
Layered Architecture for Gen AI Infrastructure
PLUG N PLAY
GEN AI PLATFORM
Layered Architecture
Concept: Gen AI
22
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
23
Artificial Intelligence Era
PLUG N PLAY
GEN AI PLATFORM
Our Mission
Our mission
Help organizations to
benefit from AI
by providing them AI platforms
Plug n Play Gen AI Platform
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Service
Benefit from our expert-led AI services that deliver
tailored solutions, from infrastructure design to
ongoing support, ensuring seamless integration
and immediate usability . With advanced security,
data integrity oversight, and personalized interfaces,
our services enhance your operations with
integrated AI tools for efficient DevOps, MLOps, and
AIOps, supporting scalable and effective AI
management.
Hosting
Elevate your AI applications with our cutting-edge
hardware and cloud infrastructure, featuring
NVIDIA GPU-equipped servers, optimized
Linux-based IG1 AI OS, and KUBE by IG1 for
efficient virtual machine and container management.
Our comprehensive solutions cover everything from
initial server setup to seamless deployment,
ensuring exceptional performance and flexibility for
your AI workloads.
Software
Enhance your AI projects with our all-inclusive
software solutions designed for deploying and
managing Large Language Models (LLMs) and
Retrieval-Augmented Generation (RAG)
systems . Our packages offer powerful tools for API
integration, data pipeline management,
containerized deployment, and comprehensive
observability, ensuring smooth operations and
insightful performance metrics for optimal resource
management.
Pricing
Our Offers , starting July 2024
+ +
24
PLUG N PLAY
GEN AI PLATFORM
Our Offers
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Full RAG: Embedding,
Reranking resources
and Vector DB
LLMs inference servers for multi LLMs
Slack Support 24 / 7
Control your data,
hosted in France
KUBE by IG1 for AI
Train, Fine-tune et run your own Models
Metrology & Supervision
by Sismology
Ollama to
OpenAI API translator
H100 & H200 GPUs
AIOps, MLOps
Consulting
Support
25
Plug n Play Platforms for Gen AI
Models quantization
for LLMs optimization
OpenAI API-compatible core
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
2. Customer Testimonials
26
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Customer Testimonials: Easybourse
27
Data Security and GDPR Compliance
CUSTOMER
TESTIMONIAL
Context Highly regulated
banking sector
Absolute data
security
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Customer Testimonials: Easybourse
28
Expertise and innovation at Iguana Solutions
CUSTOMER
TESTIMONIAL
Research & Development
● Recognized expertise on highly innovative topics
● Benefit from one year of R&D
● Numerous tests to evaluate what works
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Customer Testimonials: Easybourse
29
Need for internal resources
CUSTOMER
TESTIMONIAL
HR
● No internal resources dedicated to R&D on GenAI
● Difficulty progressing alone on the subject
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Customer Testimonials: Easybourse
30
Deployed Gen AI Platform
CUSTOMER
TESTIMONIAL
Easybourse
Gen AI Platform
1. Hardware & Cloud Infrastructures
2. LLM & RAG Deployments
3. Integration, Orchestration, and Deployment Tools
4. AI Applications
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Customer Testimonials: Easybourse
31
Utilization of Code Copilot
CUSTOMER
TESTIMONIAL
Use Cases
● Immediate adoption of Copilot for coding assistance
● Ease of deployment: Plug n Play in less than an hour
● Proven efficiency from the first uses
● Unfailing and high-quality support and guidance
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Customer Testimonials: Easybourse
32
Deployment and Next Steps
CUSTOMER
TESTIMONIAL
Use Cases
● Initial deployment of GenAI for operations
● Current testing of the IG1 platform with RAG
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Customer Testimonials: Easybourse
33
Key Benefits for Easybourse
CUSTOMER
TESTIMONIAL
Key Benefits
● Guaranteed data security, no DPO discussions
● Access to results from over a year of R&D
● True Plug n Play: operational in no time
● Very high-quality professional service
Iguane Solutions ©2024 - all rights reserved - ig1.com
IG1
Questions
34
Service
Hosting Software
+ +
Iguane Solutions ©2024 - all rights reserved - ig1.com
36

More Related Content

20240702 Présentation Plateforme GenAI.pdf

  • 1. Iguane Solutions ©2024 - ig1.com 2024.07.02 - Plug n Play Gen AI Platform
  • 2. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 A New Era 2
  • 3. IG1 Iguane Solutions ©2024 - all rights reserved - ig1.com Index Plug n Play Gen AI Platform Event 3 PLUG N PLAY GEN AI PLATFORM CUSTOMER TESTIMONIAL QUESTIONS
  • 4. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 1. Plug n Play Gen AI Platform 4
  • 5. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Plug n Play Gen AI Platform Layered Architecture for Gen AI Infrastructure PLUG N PLAY GEN AI PLATFORM Layered Architecture Concept: Gen AI 5
  • 6. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Plug n Play Gen AI Platform 6 Context: Artificial Intelligence Era PLUG N PLAY GEN AI PLATFORM Market Size 1 source: https://www.precedenceresearch.com/artificial-intelligence-market $ 1 807 bn $ 638bn 2024 2030
  • 7. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Plug n Play Gen AI Platform AI Platform Concept PLUG N PLAY GEN AI PLATFORM Layered Architecture Concept: Gen AI What is an AI Platform ? 7
  • 8. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Not everyone can move to the public cloud or use OpenAI . 1. Plug n Play Gen AI Platform 8 Context: Artificial Intelligence Era PLUG N PLAY GEN AI PLATFORM Context
  • 9. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Plug n Play Gen AI Platform Layered Architecture for Gen AI Infrastructure PLUG N PLAY GEN AI PLATFORM Layered Architecture Concept: Gen AI 1. Hardware & Cloud : Infrastructure 2. Model Foundation: LLM & RAG usage 3. Integration, Orchestration & Deployment tooling 4. Gen AI Applications 9
  • 10. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Plug n Play Gen AI Platform Layered Architecture for Gen AI Infrastructure / Layer 01: Hardware & Cloud PLUG N PLAY GEN AI PLATFORM Layered Architecture Concept: Gen AI Unpacking and Initial Setup Unpack the Hardware Carefully unpack the servers, NVidia GPUs, and other hardware components. Purpose: Ensures that all components are intact and ready for installation. Rack the Servers Install the servers into the designated racks in the data center. Purpose: Provides a secure and organized physical setup. Connect Power and Networking Connect the servers to power sources and the data center network. Purpose: Ensures the servers are powered and networked for subsequent configuration. Hardware Configuration Install NVidia GPUs Physically install the NVidia GPUs into the servers according to the manufacturer's instructions. Purpose: Provides the necessary hardware acceleration for AI computations. Verify Hardware Connections Ensure all connections are secure and components are properly seated. Purpose: Prevents hardware failures and connectivity issues during operation. Physical Servers 10
  • 11. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Plug n Play Gen AI Platform Layered Architecture for Gen AI Infrastructure / Layer 01: Hardware & Cloud PLUG N PLAY GEN AI PLATFORM Layered Architecture Concept: Gen AI Operating System Installation Install the OS Install the OS: Install IG1 AI OS, a specially designed operating system tailored for AI services, leveraging our deep expertise and capability in managing "plug and play" platforms for AI. Purpose: Provides the underlying operating system for all software and services. Update the System Run system updates to ensure all packages are up to date. Purpose: Ensures the system has the latest security patches and features. GPU Drivers and CUDA Installation NVidia Drivers Install the latest NVidia drivers for the GPUs. Purpose: Enables the operating system to communicate with the GPUs. CUDA Toolkit "CUDA toolkit" is embedded in IG1 OS. Purpose: Provides the necessary libraries and tools for developing and running GPU-accelerated applications. Base System 11
  • 12. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Plug n Play Gen AI Platform Layered Architecture for Gen AI Infrastructure / Layer 01: Hardware & Cloud PLUG N PLAY GEN AI PLATFORM Layered Architecture Concept: Gen AI KUBE by IG1 for AI Installation and Configuration Install KUBE by IG1 Follow the installation guide for KUBE by IG1 to set up the virtualization layer. Purpose: Provides a platform for managing virtual machines and containers. Configure Networking Set up networking within KUBE to ensure communication between nodes and external access. Purpose: Ensures seamless communication and data transfer within the cluster and with external clients. Cluster Installation Initialize KUBE Cluster Initialize the KUBE cluster to create a control plane and add worker nodes. Purpose: Establishes the core infrastructure for managing containerized applications. Verify Cluster Health Check the health and status of the KUBE cluster to ensure all components are functioning correctly. Purpose: Identifies and resolves any issues before proceeding with further setup. 12
  • 13. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 LLM Model Setup LLM Model Setup Download LLM Obtain the LLM from the appropriate source. Purpose: Provides the base AI model for various applications. LLM Optimization Optimization consists in optimising resource usage by preparing and enhancing LLMs through a process called quantization. Quantization increases inference performance without significantly compromising accuracy. Our quantization management services utilize the AWQ project, which provides excellent performance in terms of speed and accuracy. LLMs Inference servers Similar to database engines, LLMs inference servers run LLMs for inference or embedding. IG1 installs and manages all the necessary services for the proper functioning of LLM models. For this, we rely on several instances of: - VLLM , ideal for models without quantization FP16, - Nvidia Triton Inference server , for optimized models with Nvidia TensorRT-LLM - TGI (Text Generation Inference) for Hugging Face models Plug n Play Gen AI Platform Layered Architecture for Gen AI Infrastructure / Layer 02: Model Foundation PLUG N PLAY GEN AI PLATFORM Layered Architecture Concept: Gen AI 13
  • 14. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 RAG Setup RAG (Retrieval-Augmented Generation) Setup Integrate RAG Components Set up the necessary RAG components (example using the LlamaIndex framework): - Retriever: Finds the most relevant information from the data. - Generator: Uses the retrieved information to generate accurate responses. - Embedding: Transforms data into vector representations to improve retrieval accuracy. - Reranking: Organizes and prioritizes the retrieved results based on relevance. Purpose: Enhances the LLM with retrieval-augmented capabilities for more accurate and relevant responses. Deploy RAG Pipeline Deploy the RAG pipeline within the KUBE environment. Purpose: Ensures the RAG system is operational and integrated with the LLM model. Plug n Play Gen AI Platform Layered Architecture for Gen AI Infrastructure / Layer 02: Model Foundation PLUG N PLAY GEN AI PLATFORM Layered Architecture Concept: Gen AI 14
  • 15. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Integration of AI Services Integrate various AI services seamlessly to ensure efficient communication and operation. This includes: API Integrations Connect your AI models to various APIs for extended functionalities, including data retrieval, processing, and user interface interactions. Data Pipelines Establish data pipelines to ensure smooth data flow between different components, facilitating real-time data processing and analysis. The API Core acts as a Proxy LLM , balancing the load between LLMs inference server instances . LiteLLM , deployed in High Availability, is used for this purpose. It offers wide support for LLM servers, robustness, and usage information and API key storage through PostgreSQL . LiteLLM also enables synchronization between different instances and sends LLM usage information to our observability tools . Integration AI Services Plug n Play Gen AI Platform Layered Architecture for Gen AI Infrastructure / Layer 03: Integration, Orchestration & Deployment Tooling PLUG N PLAY GEN AI PLATFORM Layered Architecture Concept: Gen AI 15
  • 16. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Observability & Traceability Observability and Traceability Implement observability tools to gain insights into the behavior and performance of your AI applications: Centralized Logging Aggregate logs from different services and applications in a central location for easier analysis and troubleshooting Metrics Collection Collect metrics on various aspects of your applications' performance, such as response times, error rates, and resource usage. Distributed Tracing Use distributed tracing to track requests as they flow through different services, helping to identify bottlenecks and optimize performance.. The LLMs observability layer collects usage data and execution traces, ensuring proper LLM management . IG1 efficiently manages LLM usage through a monitoring stack connected to the LLM orchestrator . Lago and OpenMeter collect information , which is then transmitted to our central observability system, Sismology . Plug n Play Gen AI Platform Layered Architecture for Gen AI Infrastructure / Layer 03: Integration, Orchestration & Deployment Tooling PLUG N PLAY GEN AI PLATFORM Layered Architecture Concept: Gen AI 16
  • 17. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Chat Plug n Play Gen AI Platform Layered Architecture for Gen AI Infrastructure / Layer 04: AI Applications PLUG N PLAY GEN AI PLATFORM Layered Architecture Concept: Gen AI 17
  • 18. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Dev Copilot Plug n Play Gen AI Platform Layered Architecture for Gen AI Infrastructure / Layer 04: AI Applications PLUG N PLAY GEN AI PLATFORM Layered Architecture Concept: Gen AI 18
  • 19. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 1. Plug n Play Gen AI Platform Context: Artificial Intelligence Era PLUG N PLAY GEN AI PLATFORM Context 92% of US-based developers have already used an AI assistant for coding 2 92% 2 https://github.blog/2023-06-13-survey-reveals-ais-impact-on-the-developer-experience/ 19
  • 20. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Low Code LLM tool Plug n Play Gen AI Platform Layered Architecture for Gen AI Infrastructure / Layer 04: AI Applications PLUG N PLAY GEN AI PLATFORM Layered Architecture Concept: Gen AI 20
  • 21. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 API Setup . . . Plug n Play Gen AI Platform Layered Architecture for Gen AI Infrastructure / Layer 04: AI Applications PLUG N PLAY GEN AI PLATFORM Layered Architecture Concept: Gen AI 21
  • 22. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Physical Servers Base System KUBE by IG1 for AI LLM Model Setup RAG Setup Integration AI Services Observability & Traceability Chat Dev Copilot Low Code LLM tool API Setup . . . Plug n Play Gen AI Platform Layered Architecture for Gen AI Infrastructure PLUG N PLAY GEN AI PLATFORM Layered Architecture Concept: Gen AI 22
  • 23. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 23 Artificial Intelligence Era PLUG N PLAY GEN AI PLATFORM Our Mission Our mission Help organizations to benefit from AI by providing them AI platforms Plug n Play Gen AI Platform
  • 24. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Service Benefit from our expert-led AI services that deliver tailored solutions, from infrastructure design to ongoing support, ensuring seamless integration and immediate usability . With advanced security, data integrity oversight, and personalized interfaces, our services enhance your operations with integrated AI tools for efficient DevOps, MLOps, and AIOps, supporting scalable and effective AI management. Hosting Elevate your AI applications with our cutting-edge hardware and cloud infrastructure, featuring NVIDIA GPU-equipped servers, optimized Linux-based IG1 AI OS, and KUBE by IG1 for efficient virtual machine and container management. Our comprehensive solutions cover everything from initial server setup to seamless deployment, ensuring exceptional performance and flexibility for your AI workloads. Software Enhance your AI projects with our all-inclusive software solutions designed for deploying and managing Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) systems . Our packages offer powerful tools for API integration, data pipeline management, containerized deployment, and comprehensive observability, ensuring smooth operations and insightful performance metrics for optimal resource management. Pricing Our Offers , starting July 2024 + + 24 PLUG N PLAY GEN AI PLATFORM Our Offers
  • 25. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Full RAG: Embedding, Reranking resources and Vector DB LLMs inference servers for multi LLMs Slack Support 24 / 7 Control your data, hosted in France KUBE by IG1 for AI Train, Fine-tune et run your own Models Metrology & Supervision by Sismology Ollama to OpenAI API translator H100 & H200 GPUs AIOps, MLOps Consulting Support 25 Plug n Play Platforms for Gen AI Models quantization for LLMs optimization OpenAI API-compatible core
  • 26. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 2. Customer Testimonials 26
  • 27. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Customer Testimonials: Easybourse 27 Data Security and GDPR Compliance CUSTOMER TESTIMONIAL Context Highly regulated banking sector Absolute data security
  • 28. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Customer Testimonials: Easybourse 28 Expertise and innovation at Iguana Solutions CUSTOMER TESTIMONIAL Research & Development ● Recognized expertise on highly innovative topics ● Benefit from one year of R&D ● Numerous tests to evaluate what works
  • 29. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Customer Testimonials: Easybourse 29 Need for internal resources CUSTOMER TESTIMONIAL HR ● No internal resources dedicated to R&D on GenAI ● Difficulty progressing alone on the subject
  • 30. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Customer Testimonials: Easybourse 30 Deployed Gen AI Platform CUSTOMER TESTIMONIAL Easybourse Gen AI Platform 1. Hardware & Cloud Infrastructures 2. LLM & RAG Deployments 3. Integration, Orchestration, and Deployment Tools 4. AI Applications
  • 31. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Customer Testimonials: Easybourse 31 Utilization of Code Copilot CUSTOMER TESTIMONIAL Use Cases ● Immediate adoption of Copilot for coding assistance ● Ease of deployment: Plug n Play in less than an hour ● Proven efficiency from the first uses ● Unfailing and high-quality support and guidance
  • 32. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Customer Testimonials: Easybourse 32 Deployment and Next Steps CUSTOMER TESTIMONIAL Use Cases ● Initial deployment of GenAI for operations ● Current testing of the IG1 platform with RAG
  • 33. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Customer Testimonials: Easybourse 33 Key Benefits for Easybourse CUSTOMER TESTIMONIAL Key Benefits ● Guaranteed data security, no DPO discussions ● Access to results from over a year of R&D ● True Plug n Play: operational in no time ● Very high-quality professional service
  • 34. Iguane Solutions ©2024 - all rights reserved - ig1.com IG1 Questions 34
  • 36. Iguane Solutions ©2024 - all rights reserved - ig1.com 36