Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
SlideShare a Scribd company logo
ADVANCING DRUG SEARCH
WITH POSTGRESQL AND
AZURE AI
Taras Kloba
Senior Solutions Architect, Data & AI at
Microsoft
POSETTE - 2024
PANACEA
Taras Kloba
Senior Solutions Architect, Data & AI at Microsoft
• Founded a volunteering IT group for Ukrainian army
systems, winner of the TIDE NATO Hackathon and
Ukraine Defence Hackathon.
• Co-leader of PostgreSQL Ukraine and Big Data
Community
• More than 14 years of experience in Data
Engineering
• Ph.D. in Economics
• Winner of the Ukrainian IT Awards 2019 in Software
Architecture
• Certified Cloud Architect & Data Engineer on Google
Cloud, Microsoft, and Amazon Web Services.
Microsoft Certified Trainer
• Father of three daughters
PANACEA
the challenge.
understanding our challenge
3
PANACEA
Medication Match-Up.
When a new war starts, NATO troops
from different countries unite, each
with their medical supplies. Our
challenge was to make an app that
helps find the same medicine
across different country brands.
4
The Suwalki Gap: A Hypothetical Flashpoint
PANACEA
PANACEA
the solution.
innovation born of necessity
5
PANACEA
Inna Olkhova.
The diagram shows our app's ability to
match different medicine brands
through their active ingredient,
Acetaminophen. Inna’s idea helps find
the same medicine, whether it's Tylenol
in the US, Panadol in the UK,
Paracetamol in the EU, or Dafalgan in
France.
known as the "Bee Paramedic"
6
PANACEA
Tata Kepler.
From Artistry to Aid: Shaping Accessible Care in the
Field
7
Tata showed us how medicine sorting works,
especially when volunteers might not know medical
terms. She introduced us to using the ATC system in
our app, making it easier for everyone to find and
organize medicines correctly.
PANACEA
Daryna Smolnikova.
Daryna shared the idea of using NFC,
QR codes, and short links to store and
share information about the wounded.
She emphasized that sometimes,
sharing this information, even with the
enemy, is crucial to saving lives.
Immediate Action: Key to Battlefield Survival
8
PANACEA
Iryna Soloshenko.
Train Evacuations in Ukraine: Pioneering Lifesaving
Journeys
9
PANACEA
tech
specs.
engineering precision for lifesaving solutions
10
PANACEA 11
Architecture.
Azure Database
for PostgreSQL
with the Azure_AI
extension
PANACEA 12
Architecture.
Azure Database
for PostgreSQL
with the Azure_AI
extension
Can be replaced
with
azure_local_ai
extension
Can be replaced
with
Data API Builder
PANACEA 13
ILIKE.
The ILIKE operator in PostgreSQL is used for
case-insensitive pattern matching. It
functions similarly to the LIKE operator but
ignores the case of the character
SELECT *
FROM drugs
WHERE drug_name ILIKE
'tylen%';
SELECT *
FROM drugs
WHERE drug_name ILIKE
'%tylen%';
PANACEA 14
TRIGRAMS.
A trigram is a group of three consecutive
characters in a string. By comparing these
trigrams, PostgreSQL can measure the
similarity between strings. Useful for finding
similar strings despite minor spelling errors
and variations.
CREATE EXTENSION pg_trgm;
SELECT drug_name
FROM drugs
WHERE drug_name % 'tylelon'
ORDER BY similarity(drug_name, 'tylelon')
DESC;
PANACEA 15
VECTOR SEARCH.
Vector search uses vector representations of text to
find similar items. By comparing these vectors,
PostgreSQL can measure the similarity between text
descriptions.
CREATE EXTENSION azure_ai;
ALTER TABLE drugs
ADD COLUMN vector VECTOR GENERATED ALWAYS AS
(azure_ai.vectorize(drug_name)) STORED;
SELECT drug_name
FROM drugs
ORDER BY vector <=> azure_ai.vectorize('pill for
pain management')
LIMIT 5;
PANACEA 16
VECTOR SEARCH.
Vector search uses vector representations of text to
find similar items. By comparing these vectors,
PostgreSQL can measure the similarity between text
descriptions.
CREATE EXTENSION azure_ai;
ALTER TABLE drugs
ADD COLUMN vector VECTOR GENERATED ALWAYS AS
(azure_ai.vectorize(drug_name)) STORED;
SELECT drug_name
FROM drugs
ORDER BY vector <=> azure_ai.vectorize('red pill for
headaches')
LIMIT 5;
PANACEA 17
DRUGGPT (AI RAG ASSSISTANT).
RAG (Retrieval-Augmented Generation) is a hybrid
approach that combines retrieval and generation
techniques to enhance the quality of AI responses. It
first retrieves relevant information from a knowledge
base or database and then uses this information to
generate more accurate and contextually relevant
answers.
PANACEA 18
Search.
Layer 1 Layer 2 Layer 3 Layer 4 Layer 5 Layer 6 Layer 7
Machine learning models
for semantic
representation, ranking
for the rest of results.
Exact and pattern text
search (ilike%), trigram
similarity calculation,
and ranking for best
results.
Similarity search with
threshold 0.7, ranking for
best results.
Machine learning models
for semantic
representation, with a
threshold of 0.9, ranking
for best results.
Pattern text search
(%ilike%), trigram
similarity calculation,
ranking for best results.
Full-text search in
summary, description, and
indication columns.
Similarity search with
threshold 0.5, ranking for
best results.
Yes
No
Is result
found?
Yes
No
Is result
found?
Yes
No
Is result
found?
Yes
No
Is result
found?
Yes
No
Is result
found?
Yes
Is result
found?
No
PANACEA 19
OCR Search.
[Tylenol], [Acetaminophen], [Pain], [Reliever],
[Fever], [Reducer], [Extra], [Strength], [FOR],
[ADULTS], [500], [mg], [each]
Database
PANACEA 20
Knowledge Graph.
Powered by DrugBank.com database
PANACEA
demo.
21
PANACEA
try it out.
22
PANACEA
get in touch.
Taras Kloba
aka.ms/taras
23

More Related Content

Similar to "NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba

FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable Predictions
Michel Dumontier
 
IBM_Analytics_eBook_07 15 16
IBM_Analytics_eBook_07 15 16IBM_Analytics_eBook_07 15 16
IBM_Analytics_eBook_07 15 16
Volkan Tekeli
 
Multidimensioal database
Multidimensioal  databaseMultidimensioal  database
Multidimensioal database
TPO TPO
 
Multidimensioal database
Multidimensioal  databaseMultidimensioal  database
Multidimensioal database
Pankaj Thakur
 
AI/ML Webinar - Improve Public Health
AI/ML Webinar - Improve Public HealthAI/ML Webinar - Improve Public Health
AI/ML Webinar - Improve Public Health
Amazon Web Services
 
API-Centric Data Integration for Human Genomics Reference Databases: Achieve...
 API-Centric Data Integration for Human Genomics Reference Databases: Achieve... API-Centric Data Integration for Human Genomics Reference Databases: Achieve...
API-Centric Data Integration for Human Genomics Reference Databases: Achieve...
Genomika Diagnósticos
 
Btp.pptx
Btp.pptxBtp.pptx
How SAP HANA can provide value for Pharma R&D
How SAP HANA can provide value for Pharma R&DHow SAP HANA can provide value for Pharma R&D
How SAP HANA can provide value for Pharma R&D
Marc Maurer
 
IOT.ppt
IOT.pptIOT.ppt
IOT.ppt
Mvidhya9
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j
 
Nikhil CV
Nikhil CVNikhil CV
Nikhil CV
Nikhil Sharma
 
Regression with Microsoft Azure & Ms Excel
Regression with Microsoft Azure & Ms ExcelRegression with Microsoft Azure & Ms Excel
Regression with Microsoft Azure & Ms Excel
Dr. Abdul Ahad Abro
 
A Survey on Bioinformatics Tools
A Survey on Bioinformatics ToolsA Survey on Bioinformatics Tools
A Survey on Bioinformatics Tools
idescitation
 
Comparison of machine learning methods for breast cancer diagnosis
Comparison of machine learning methods for breast cancer diagnosisComparison of machine learning methods for breast cancer diagnosis
Comparison of machine learning methods for breast cancer diagnosis
Venkat Projects
 
Fishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data LakeFishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data Lake
ArangoDB Database
 
Fishing Graphs in a Hadoop Data Lake by Jörg Schad and Max Neunhoeffer at Big...
Fishing Graphs in a Hadoop Data Lake by Jörg Schad and Max Neunhoeffer at Big...Fishing Graphs in a Hadoop Data Lake by Jörg Schad and Max Neunhoeffer at Big...
Fishing Graphs in a Hadoop Data Lake by Jörg Schad and Max Neunhoeffer at Big...
Big Data Spain
 
50120140504015
5012014050401550120140504015
50120140504015
IAEME Publication
 
A WEB REPOSITORY SYSTEM FOR DATA MINING IN DRUG DISCOVERY
A WEB REPOSITORY SYSTEM FOR DATA MINING IN DRUG DISCOVERYA WEB REPOSITORY SYSTEM FOR DATA MINING IN DRUG DISCOVERY
A WEB REPOSITORY SYSTEM FOR DATA MINING IN DRUG DISCOVERY
IJDKP
 
A WEB REPOSITORY SYSTEM FOR DATA MINING IN DRUG DISCOVERY
A WEB REPOSITORY SYSTEM FOR DATA MINING IN DRUG DISCOVERYA WEB REPOSITORY SYSTEM FOR DATA MINING IN DRUG DISCOVERY
A WEB REPOSITORY SYSTEM FOR DATA MINING IN DRUG DISCOVERY
IJDKP
 
Comparing EDA with classical and Bayesian analysis.pptx
Comparing EDA with classical and Bayesian analysis.pptxComparing EDA with classical and Bayesian analysis.pptx
Comparing EDA with classical and Bayesian analysis.pptx
PremaGanesh1
 

Similar to "NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba (20)

FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable Predictions
 
IBM_Analytics_eBook_07 15 16
IBM_Analytics_eBook_07 15 16IBM_Analytics_eBook_07 15 16
IBM_Analytics_eBook_07 15 16
 
Multidimensioal database
Multidimensioal  databaseMultidimensioal  database
Multidimensioal database
 
Multidimensioal database
Multidimensioal  databaseMultidimensioal  database
Multidimensioal database
 
AI/ML Webinar - Improve Public Health
AI/ML Webinar - Improve Public HealthAI/ML Webinar - Improve Public Health
AI/ML Webinar - Improve Public Health
 
API-Centric Data Integration for Human Genomics Reference Databases: Achieve...
 API-Centric Data Integration for Human Genomics Reference Databases: Achieve... API-Centric Data Integration for Human Genomics Reference Databases: Achieve...
API-Centric Data Integration for Human Genomics Reference Databases: Achieve...
 
Btp.pptx
Btp.pptxBtp.pptx
Btp.pptx
 
How SAP HANA can provide value for Pharma R&D
How SAP HANA can provide value for Pharma R&DHow SAP HANA can provide value for Pharma R&D
How SAP HANA can provide value for Pharma R&D
 
IOT.ppt
IOT.pptIOT.ppt
IOT.ppt
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Nikhil CV
Nikhil CVNikhil CV
Nikhil CV
 
Regression with Microsoft Azure & Ms Excel
Regression with Microsoft Azure & Ms ExcelRegression with Microsoft Azure & Ms Excel
Regression with Microsoft Azure & Ms Excel
 
A Survey on Bioinformatics Tools
A Survey on Bioinformatics ToolsA Survey on Bioinformatics Tools
A Survey on Bioinformatics Tools
 
Comparison of machine learning methods for breast cancer diagnosis
Comparison of machine learning methods for breast cancer diagnosisComparison of machine learning methods for breast cancer diagnosis
Comparison of machine learning methods for breast cancer diagnosis
 
Fishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data LakeFishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data Lake
 
Fishing Graphs in a Hadoop Data Lake by Jörg Schad and Max Neunhoeffer at Big...
Fishing Graphs in a Hadoop Data Lake by Jörg Schad and Max Neunhoeffer at Big...Fishing Graphs in a Hadoop Data Lake by Jörg Schad and Max Neunhoeffer at Big...
Fishing Graphs in a Hadoop Data Lake by Jörg Schad and Max Neunhoeffer at Big...
 
50120140504015
5012014050401550120140504015
50120140504015
 
A WEB REPOSITORY SYSTEM FOR DATA MINING IN DRUG DISCOVERY
A WEB REPOSITORY SYSTEM FOR DATA MINING IN DRUG DISCOVERYA WEB REPOSITORY SYSTEM FOR DATA MINING IN DRUG DISCOVERY
A WEB REPOSITORY SYSTEM FOR DATA MINING IN DRUG DISCOVERY
 
A WEB REPOSITORY SYSTEM FOR DATA MINING IN DRUG DISCOVERY
A WEB REPOSITORY SYSTEM FOR DATA MINING IN DRUG DISCOVERYA WEB REPOSITORY SYSTEM FOR DATA MINING IN DRUG DISCOVERY
A WEB REPOSITORY SYSTEM FOR DATA MINING IN DRUG DISCOVERY
 
Comparing EDA with classical and Bayesian analysis.pptx
Comparing EDA with classical and Bayesian analysis.pptxComparing EDA with classical and Bayesian analysis.pptx
Comparing EDA with classical and Bayesian analysis.pptx
 

More from Fwdays

"What does it really mean for your system to be available, or how to define w...
"What does it really mean for your system to be available, or how to define w..."What does it really mean for your system to be available, or how to define w...
"What does it really mean for your system to be available, or how to define w...
Fwdays
 
"Microservices and multitenancy - how to serve thousands of databases in one ...
"Microservices and multitenancy - how to serve thousands of databases in one ..."Microservices and multitenancy - how to serve thousands of databases in one ...
"Microservices and multitenancy - how to serve thousands of databases in one ...
Fwdays
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
Fwdays
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
Fwdays
 
"Black Monday: The Story of 5.5 Hours of Downtime", Dmytro Dziubenko
"Black Monday: The Story of 5.5 Hours of Downtime", Dmytro Dziubenko"Black Monday: The Story of 5.5 Hours of Downtime", Dmytro Dziubenko
"Black Monday: The Story of 5.5 Hours of Downtime", Dmytro Dziubenko
Fwdays
 
"Reaching 3_000_000 HTTP requests per second — conclusions from participation...
"Reaching 3_000_000 HTTP requests per second — conclusions from participation..."Reaching 3_000_000 HTTP requests per second — conclusions from participation...
"Reaching 3_000_000 HTTP requests per second — conclusions from participation...
Fwdays
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
Fwdays
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
Fwdays
 
"What I learned through reverse engineering", Yuri Artiukh
"What I learned through reverse engineering", Yuri Artiukh"What I learned through reverse engineering", Yuri Artiukh
"What I learned through reverse engineering", Yuri Artiukh
Fwdays
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
Fwdays
 
"Micro frontends: Unbelievably true life story", Dmytro Pavlov
"Micro frontends: Unbelievably true life story", Dmytro Pavlov"Micro frontends: Unbelievably true life story", Dmytro Pavlov
"Micro frontends: Unbelievably true life story", Dmytro Pavlov
Fwdays
 
"Objects validation and comparison using runtime types (io-ts)", Oleksandr Suhak
"Objects validation and comparison using runtime types (io-ts)", Oleksandr Suhak"Objects validation and comparison using runtime types (io-ts)", Oleksandr Suhak
"Objects validation and comparison using runtime types (io-ts)", Oleksandr Suhak
Fwdays
 
"JavaScript. Standard evolution, when nobody cares", Roman Savitskyi
"JavaScript. Standard evolution, when nobody cares", Roman Savitskyi"JavaScript. Standard evolution, when nobody cares", Roman Savitskyi
"JavaScript. Standard evolution, when nobody cares", Roman Savitskyi
Fwdays
 
"How Preply reduced ML model development time from 1 month to 1 day",Yevhen Y...
"How Preply reduced ML model development time from 1 month to 1 day",Yevhen Y..."How Preply reduced ML model development time from 1 month to 1 day",Yevhen Y...
"How Preply reduced ML model development time from 1 month to 1 day",Yevhen Y...
Fwdays
 
"GenAI Apps: Our Journey from Ideas to Production Excellence",Danil Topchii
"GenAI Apps: Our Journey from Ideas to Production Excellence",Danil Topchii"GenAI Apps: Our Journey from Ideas to Production Excellence",Danil Topchii
"GenAI Apps: Our Journey from Ideas to Production Excellence",Danil Topchii
Fwdays
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
Fwdays
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
Fwdays
 
"What is a RAG system and how to build it",Dmytro Spodarets
"What is a RAG system and how to build it",Dmytro Spodarets"What is a RAG system and how to build it",Dmytro Spodarets
"What is a RAG system and how to build it",Dmytro Spodarets
Fwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
Fwdays
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
Fwdays
 

More from Fwdays (20)

"What does it really mean for your system to be available, or how to define w...
"What does it really mean for your system to be available, or how to define w..."What does it really mean for your system to be available, or how to define w...
"What does it really mean for your system to be available, or how to define w...
 
"Microservices and multitenancy - how to serve thousands of databases in one ...
"Microservices and multitenancy - how to serve thousands of databases in one ..."Microservices and multitenancy - how to serve thousands of databases in one ...
"Microservices and multitenancy - how to serve thousands of databases in one ...
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
 
"Black Monday: The Story of 5.5 Hours of Downtime", Dmytro Dziubenko
"Black Monday: The Story of 5.5 Hours of Downtime", Dmytro Dziubenko"Black Monday: The Story of 5.5 Hours of Downtime", Dmytro Dziubenko
"Black Monday: The Story of 5.5 Hours of Downtime", Dmytro Dziubenko
 
"Reaching 3_000_000 HTTP requests per second — conclusions from participation...
"Reaching 3_000_000 HTTP requests per second — conclusions from participation..."Reaching 3_000_000 HTTP requests per second — conclusions from participation...
"Reaching 3_000_000 HTTP requests per second — conclusions from participation...
 
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin..."$10 thousand per minute of downtime: architecture, queues, streaming and fin...
"$10 thousand per minute of downtime: architecture, queues, streaming and fin...
 
"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota"Choosing proper type of scaling", Olena Syrota
"Choosing proper type of scaling", Olena Syrota
 
"What I learned through reverse engineering", Yuri Artiukh
"What I learned through reverse engineering", Yuri Artiukh"What I learned through reverse engineering", Yuri Artiukh
"What I learned through reverse engineering", Yuri Artiukh
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
"Micro frontends: Unbelievably true life story", Dmytro Pavlov
"Micro frontends: Unbelievably true life story", Dmytro Pavlov"Micro frontends: Unbelievably true life story", Dmytro Pavlov
"Micro frontends: Unbelievably true life story", Dmytro Pavlov
 
"Objects validation and comparison using runtime types (io-ts)", Oleksandr Suhak
"Objects validation and comparison using runtime types (io-ts)", Oleksandr Suhak"Objects validation and comparison using runtime types (io-ts)", Oleksandr Suhak
"Objects validation and comparison using runtime types (io-ts)", Oleksandr Suhak
 
"JavaScript. Standard evolution, when nobody cares", Roman Savitskyi
"JavaScript. Standard evolution, when nobody cares", Roman Savitskyi"JavaScript. Standard evolution, when nobody cares", Roman Savitskyi
"JavaScript. Standard evolution, when nobody cares", Roman Savitskyi
 
"How Preply reduced ML model development time from 1 month to 1 day",Yevhen Y...
"How Preply reduced ML model development time from 1 month to 1 day",Yevhen Y..."How Preply reduced ML model development time from 1 month to 1 day",Yevhen Y...
"How Preply reduced ML model development time from 1 month to 1 day",Yevhen Y...
 
"GenAI Apps: Our Journey from Ideas to Production Excellence",Danil Topchii
"GenAI Apps: Our Journey from Ideas to Production Excellence",Danil Topchii"GenAI Apps: Our Journey from Ideas to Production Excellence",Danil Topchii
"GenAI Apps: Our Journey from Ideas to Production Excellence",Danil Topchii
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
"What is a RAG system and how to build it",Dmytro Spodarets
"What is a RAG system and how to build it",Dmytro Spodarets"What is a RAG system and how to build it",Dmytro Spodarets
"What is a RAG system and how to build it",Dmytro Spodarets
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 

Recently uploaded

Observability For You and Me with OpenTelemetry
Observability For You and Me with OpenTelemetryObservability For You and Me with OpenTelemetry
Observability For You and Me with OpenTelemetry
Eric D. Schabell
 
HTTP Adaptive Streaming – Quo Vadis (2024)
HTTP Adaptive Streaming – Quo Vadis (2024)HTTP Adaptive Streaming – Quo Vadis (2024)
HTTP Adaptive Streaming – Quo Vadis (2024)
Alpen-Adria-Universität
 
Transcript: Details of description part II: Describing images in practice - T...
Transcript: Details of description part II: Describing images in practice - T...Transcript: Details of description part II: Describing images in practice - T...
Transcript: Details of description part II: Describing images in practice - T...
BookNet Canada
 
Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...
Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...
Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...
Erasmo Purificato
 
Details of description part II: Describing images in practice - Tech Forum 2024
Details of description part II: Describing images in practice - Tech Forum 2024Details of description part II: Describing images in practice - Tech Forum 2024
Details of description part II: Describing images in practice - Tech Forum 2024
BookNet Canada
 
Verti - EMEA Insurer Innovation Award 2024
Verti - EMEA Insurer Innovation Award 2024Verti - EMEA Insurer Innovation Award 2024
Verti - EMEA Insurer Innovation Award 2024
The Digital Insurer
 
How Netflix Builds High Performance Applications at Global Scale
How Netflix Builds High Performance Applications at Global ScaleHow Netflix Builds High Performance Applications at Global Scale
How Netflix Builds High Performance Applications at Global Scale
ScyllaDB
 
5G bootcamp Sep 2020 (NPI initiative).pptx
5G bootcamp Sep 2020 (NPI initiative).pptx5G bootcamp Sep 2020 (NPI initiative).pptx
5G bootcamp Sep 2020 (NPI initiative).pptx
SATYENDRA100
 
Implementations of Fused Deposition Modeling in real world
Implementations of Fused Deposition Modeling  in real worldImplementations of Fused Deposition Modeling  in real world
Implementations of Fused Deposition Modeling in real world
Emerging Tech
 
AI_dev Europe 2024 - From OpenAI to Opensource AI
AI_dev Europe 2024 - From OpenAI to Opensource AIAI_dev Europe 2024 - From OpenAI to Opensource AI
AI_dev Europe 2024 - From OpenAI to Opensource AI
Raphaël Semeteys
 
GDG Cloud Southlake #34: Neatsun Ziv: Automating Appsec
GDG Cloud Southlake #34: Neatsun Ziv: Automating AppsecGDG Cloud Southlake #34: Neatsun Ziv: Automating Appsec
GDG Cloud Southlake #34: Neatsun Ziv: Automating Appsec
James Anderson
 
Research Directions for Cross Reality Interfaces
Research Directions for Cross Reality InterfacesResearch Directions for Cross Reality Interfaces
Research Directions for Cross Reality Interfaces
Mark Billinghurst
 
Pigging Solutions Sustainability brochure.pdf
Pigging Solutions Sustainability brochure.pdfPigging Solutions Sustainability brochure.pdf
Pigging Solutions Sustainability brochure.pdf
Pigging Solutions
 
UiPath Community Day Kraków: Devs4Devs Conference
UiPath Community Day Kraków: Devs4Devs ConferenceUiPath Community Day Kraków: Devs4Devs Conference
UiPath Community Day Kraków: Devs4Devs Conference
UiPathCommunity
 
Running a Go App in Kubernetes: CPU Impacts
Running a Go App in Kubernetes: CPU ImpactsRunning a Go App in Kubernetes: CPU Impacts
Running a Go App in Kubernetes: CPU Impacts
ScyllaDB
 
The Increasing Use of the National Research Platform by the CSU Campuses
The Increasing Use of the National Research Platform by the CSU CampusesThe Increasing Use of the National Research Platform by the CSU Campuses
The Increasing Use of the National Research Platform by the CSU Campuses
Larry Smarr
 
MYIR Product Brochure - A Global Provider of Embedded SOMs & Solutions
MYIR Product Brochure - A Global Provider of Embedded SOMs & SolutionsMYIR Product Brochure - A Global Provider of Embedded SOMs & Solutions
MYIR Product Brochure - A Global Provider of Embedded SOMs & Solutions
Linda Zhang
 
How RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptxHow RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptx
SynapseIndia
 
Knowledge and Prompt Engineering Part 2 Focus on Prompt Design Approaches
Knowledge and Prompt Engineering Part 2 Focus on Prompt Design ApproachesKnowledge and Prompt Engineering Part 2 Focus on Prompt Design Approaches
Knowledge and Prompt Engineering Part 2 Focus on Prompt Design Approaches
Earley Information Science
 
What’s New in Teams Calling, Meetings and Devices May 2024
What’s New in Teams Calling, Meetings and Devices May 2024What’s New in Teams Calling, Meetings and Devices May 2024
What’s New in Teams Calling, Meetings and Devices May 2024
Stephanie Beckett
 

Recently uploaded (20)

Observability For You and Me with OpenTelemetry
Observability For You and Me with OpenTelemetryObservability For You and Me with OpenTelemetry
Observability For You and Me with OpenTelemetry
 
HTTP Adaptive Streaming – Quo Vadis (2024)
HTTP Adaptive Streaming – Quo Vadis (2024)HTTP Adaptive Streaming – Quo Vadis (2024)
HTTP Adaptive Streaming – Quo Vadis (2024)
 
Transcript: Details of description part II: Describing images in practice - T...
Transcript: Details of description part II: Describing images in practice - T...Transcript: Details of description part II: Describing images in practice - T...
Transcript: Details of description part II: Describing images in practice - T...
 
Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...
Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...
Paradigm Shifts in User Modeling: A Journey from Historical Foundations to Em...
 
Details of description part II: Describing images in practice - Tech Forum 2024
Details of description part II: Describing images in practice - Tech Forum 2024Details of description part II: Describing images in practice - Tech Forum 2024
Details of description part II: Describing images in practice - Tech Forum 2024
 
Verti - EMEA Insurer Innovation Award 2024
Verti - EMEA Insurer Innovation Award 2024Verti - EMEA Insurer Innovation Award 2024
Verti - EMEA Insurer Innovation Award 2024
 
How Netflix Builds High Performance Applications at Global Scale
How Netflix Builds High Performance Applications at Global ScaleHow Netflix Builds High Performance Applications at Global Scale
How Netflix Builds High Performance Applications at Global Scale
 
5G bootcamp Sep 2020 (NPI initiative).pptx
5G bootcamp Sep 2020 (NPI initiative).pptx5G bootcamp Sep 2020 (NPI initiative).pptx
5G bootcamp Sep 2020 (NPI initiative).pptx
 
Implementations of Fused Deposition Modeling in real world
Implementations of Fused Deposition Modeling  in real worldImplementations of Fused Deposition Modeling  in real world
Implementations of Fused Deposition Modeling in real world
 
AI_dev Europe 2024 - From OpenAI to Opensource AI
AI_dev Europe 2024 - From OpenAI to Opensource AIAI_dev Europe 2024 - From OpenAI to Opensource AI
AI_dev Europe 2024 - From OpenAI to Opensource AI
 
GDG Cloud Southlake #34: Neatsun Ziv: Automating Appsec
GDG Cloud Southlake #34: Neatsun Ziv: Automating AppsecGDG Cloud Southlake #34: Neatsun Ziv: Automating Appsec
GDG Cloud Southlake #34: Neatsun Ziv: Automating Appsec
 
Research Directions for Cross Reality Interfaces
Research Directions for Cross Reality InterfacesResearch Directions for Cross Reality Interfaces
Research Directions for Cross Reality Interfaces
 
Pigging Solutions Sustainability brochure.pdf
Pigging Solutions Sustainability brochure.pdfPigging Solutions Sustainability brochure.pdf
Pigging Solutions Sustainability brochure.pdf
 
UiPath Community Day Kraków: Devs4Devs Conference
UiPath Community Day Kraków: Devs4Devs ConferenceUiPath Community Day Kraków: Devs4Devs Conference
UiPath Community Day Kraków: Devs4Devs Conference
 
Running a Go App in Kubernetes: CPU Impacts
Running a Go App in Kubernetes: CPU ImpactsRunning a Go App in Kubernetes: CPU Impacts
Running a Go App in Kubernetes: CPU Impacts
 
The Increasing Use of the National Research Platform by the CSU Campuses
The Increasing Use of the National Research Platform by the CSU CampusesThe Increasing Use of the National Research Platform by the CSU Campuses
The Increasing Use of the National Research Platform by the CSU Campuses
 
MYIR Product Brochure - A Global Provider of Embedded SOMs & Solutions
MYIR Product Brochure - A Global Provider of Embedded SOMs & SolutionsMYIR Product Brochure - A Global Provider of Embedded SOMs & Solutions
MYIR Product Brochure - A Global Provider of Embedded SOMs & Solutions
 
How RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptxHow RPA Help in the Transportation and Logistics Industry.pptx
How RPA Help in the Transportation and Logistics Industry.pptx
 
Knowledge and Prompt Engineering Part 2 Focus on Prompt Design Approaches
Knowledge and Prompt Engineering Part 2 Focus on Prompt Design ApproachesKnowledge and Prompt Engineering Part 2 Focus on Prompt Design Approaches
Knowledge and Prompt Engineering Part 2 Focus on Prompt Design Approaches
 
What’s New in Teams Calling, Meetings and Devices May 2024
What’s New in Teams Calling, Meetings and Devices May 2024What’s New in Teams Calling, Meetings and Devices May 2024
What’s New in Teams Calling, Meetings and Devices May 2024
 

"NATO Hackathon Winner: AI-Powered Drug Search", Taras Kloba

  • 1. ADVANCING DRUG SEARCH WITH POSTGRESQL AND AZURE AI Taras Kloba Senior Solutions Architect, Data & AI at Microsoft POSETTE - 2024
  • 2. PANACEA Taras Kloba Senior Solutions Architect, Data & AI at Microsoft • Founded a volunteering IT group for Ukrainian army systems, winner of the TIDE NATO Hackathon and Ukraine Defence Hackathon. • Co-leader of PostgreSQL Ukraine and Big Data Community • More than 14 years of experience in Data Engineering • Ph.D. in Economics • Winner of the Ukrainian IT Awards 2019 in Software Architecture • Certified Cloud Architect & Data Engineer on Google Cloud, Microsoft, and Amazon Web Services. Microsoft Certified Trainer • Father of three daughters
  • 4. PANACEA Medication Match-Up. When a new war starts, NATO troops from different countries unite, each with their medical supplies. Our challenge was to make an app that helps find the same medicine across different country brands. 4 The Suwalki Gap: A Hypothetical Flashpoint PANACEA
  • 6. PANACEA Inna Olkhova. The diagram shows our app's ability to match different medicine brands through their active ingredient, Acetaminophen. Inna’s idea helps find the same medicine, whether it's Tylenol in the US, Panadol in the UK, Paracetamol in the EU, or Dafalgan in France. known as the "Bee Paramedic" 6
  • 7. PANACEA Tata Kepler. From Artistry to Aid: Shaping Accessible Care in the Field 7 Tata showed us how medicine sorting works, especially when volunteers might not know medical terms. She introduced us to using the ATC system in our app, making it easier for everyone to find and organize medicines correctly.
  • 8. PANACEA Daryna Smolnikova. Daryna shared the idea of using NFC, QR codes, and short links to store and share information about the wounded. She emphasized that sometimes, sharing this information, even with the enemy, is crucial to saving lives. Immediate Action: Key to Battlefield Survival 8
  • 9. PANACEA Iryna Soloshenko. Train Evacuations in Ukraine: Pioneering Lifesaving Journeys 9
  • 11. PANACEA 11 Architecture. Azure Database for PostgreSQL with the Azure_AI extension
  • 12. PANACEA 12 Architecture. Azure Database for PostgreSQL with the Azure_AI extension Can be replaced with azure_local_ai extension Can be replaced with Data API Builder
  • 13. PANACEA 13 ILIKE. The ILIKE operator in PostgreSQL is used for case-insensitive pattern matching. It functions similarly to the LIKE operator but ignores the case of the character SELECT * FROM drugs WHERE drug_name ILIKE 'tylen%'; SELECT * FROM drugs WHERE drug_name ILIKE '%tylen%';
  • 14. PANACEA 14 TRIGRAMS. A trigram is a group of three consecutive characters in a string. By comparing these trigrams, PostgreSQL can measure the similarity between strings. Useful for finding similar strings despite minor spelling errors and variations. CREATE EXTENSION pg_trgm; SELECT drug_name FROM drugs WHERE drug_name % 'tylelon' ORDER BY similarity(drug_name, 'tylelon') DESC;
  • 15. PANACEA 15 VECTOR SEARCH. Vector search uses vector representations of text to find similar items. By comparing these vectors, PostgreSQL can measure the similarity between text descriptions. CREATE EXTENSION azure_ai; ALTER TABLE drugs ADD COLUMN vector VECTOR GENERATED ALWAYS AS (azure_ai.vectorize(drug_name)) STORED; SELECT drug_name FROM drugs ORDER BY vector <=> azure_ai.vectorize('pill for pain management') LIMIT 5;
  • 16. PANACEA 16 VECTOR SEARCH. Vector search uses vector representations of text to find similar items. By comparing these vectors, PostgreSQL can measure the similarity between text descriptions. CREATE EXTENSION azure_ai; ALTER TABLE drugs ADD COLUMN vector VECTOR GENERATED ALWAYS AS (azure_ai.vectorize(drug_name)) STORED; SELECT drug_name FROM drugs ORDER BY vector <=> azure_ai.vectorize('red pill for headaches') LIMIT 5;
  • 17. PANACEA 17 DRUGGPT (AI RAG ASSSISTANT). RAG (Retrieval-Augmented Generation) is a hybrid approach that combines retrieval and generation techniques to enhance the quality of AI responses. It first retrieves relevant information from a knowledge base or database and then uses this information to generate more accurate and contextually relevant answers.
  • 18. PANACEA 18 Search. Layer 1 Layer 2 Layer 3 Layer 4 Layer 5 Layer 6 Layer 7 Machine learning models for semantic representation, ranking for the rest of results. Exact and pattern text search (ilike%), trigram similarity calculation, and ranking for best results. Similarity search with threshold 0.7, ranking for best results. Machine learning models for semantic representation, with a threshold of 0.9, ranking for best results. Pattern text search (%ilike%), trigram similarity calculation, ranking for best results. Full-text search in summary, description, and indication columns. Similarity search with threshold 0.5, ranking for best results. Yes No Is result found? Yes No Is result found? Yes No Is result found? Yes No Is result found? Yes No Is result found? Yes Is result found? No
  • 19. PANACEA 19 OCR Search. [Tylenol], [Acetaminophen], [Pain], [Reliever], [Fever], [Reducer], [Extra], [Strength], [FOR], [ADULTS], [500], [mg], [each] Database
  • 20. PANACEA 20 Knowledge Graph. Powered by DrugBank.com database
  • 23. PANACEA get in touch. Taras Kloba aka.ms/taras 23