Explore Azure AI Services: Curated list of prebuilt models and demos

  • Thread starter Thread starter RasikaSavant
  • Start date Start date
R

RasikaSavant

Explore Azure AI Services: Prebuilt Models and Demos​


Azure AI services provide a comprehensive suite of prebuilt models and demos designed to address a wide range of use cases. These models are readily accessible and allow you to implement AI-powered solutions seamlessly. We have curated and catalogued prebuilt demos available across Azure AI services. We hope this helps you infuse AI seamlessly into your products and services.



Speech Recognition​




Speech to Text Scenarios​




ScenarioDescriptionLink
Real-time speech to textQuickly test your audio on a speech recognition endpoint without writing any code.Explore Demo
Whisper Model in Azure OpenAI ServiceTranscribe and translate audio content from 57 languages into English using OpenAI Whisper v2-large model.Explore Demo
Batch speech to textTranscribe large amounts of audio in storage asynchronously.Explore Demo
Custom SpeechImprove speech recognition accuracy with domain-specific vocabulary and data.Explore Demo
Pronunciation AssessmentEvaluate and get feedback on speech pronunciation accuracy and fluency.Explore Demo
Speech TranslationTranslate speech into other languages in real-time with low latency.Explore Demo



Text to Speech​




Text to Speech Scenarios​




ScenarioDescriptionLink
Voice GalleryChoose from 486 voices across 148 languages and variants to create natural-sounding speech.Explore Demo
Custom Neural VoiceCreate a natural-sounding synthetic voice based on human voice recordings.Explore Demo
Personal VoiceCreate an AI voice from a human voice sample for personalized voice experiences.Explore Demo
Audio Content CreationBuild highly natural audio content for various scenarios like audiobooks and video narrations.Explore Demo
Text to speech AvatarTurn your text into a video with an AI-generated avatar and realistic voice.Explore Demo



Other Scenarios



Scenario

Description

Link

Captioning with Speech to Text

Use our sample application to learn how to use Azure Speech to automatically caption your content in real-time and offline by transcribing the audio of films, videos, live events, and more. Display the resulting text on a screen to provide an accessible experience. In this example, we leverage features like speech to text and phrase list.

Explore Demo

Post Call Transcription & Analytics

Batch transcribe call center recordings and extract valuable information such as Personal Identifiable Information (PII), sentiment, and call summary. This demonstrates how to use the Speech and Language services to analyze call center conversations.

Explore Demo

Live Chat Avatar

Engage in natural conversations with an avatar that recognizes users' speech input and responds fluently with realistic AI voice.

Explore Demo

Language Learning

Get instant feedback on pronunciation accuracy, fluency, prosody, grammar, and vocabulary from your chatting experience.

Explore Demo

Video Translation

Seamlessly translate and generate videos in multiple languages automatically. With its powerful capabilities, you can efficiently localize your video content to cater to diverse audiences around the globe.

Explore Demo



Vision Studio​




Vision-Based Scenarios​




ScenarioDescriptionLink
Video Retrieval and SummaryQuickly summarize the main points of a video and search for specific moments.Explore Demo
Customize Models with ImagesFind specific objects within images for use cases like product placement and assembly line checks.Explore Demo
Add Dense Captions to ImagesGenerate human-readable captions for all important objects detected in your image.Explore Demo
Remove Backgrounds from ImagesEasily remove the background and preserve foreground elements.Explore Demo
Add Captions to ImagesGenerate a human-readable sentence that describes the content of an image.Explore Demo
Detect Common Objects in ImagesDetect and extract bounding boxes for recognizable objects and living beings.Explore Demo
Extract Text from ImagesUse OCR to extract printed and handwritten text from images, PDFs, and TIFF files.Explore Demo
Extract Common Tags from ImagesExtract tags based on recognizable objects, scenery, and actions.Explore Demo
Create Smart-Cropped ImagesAutomatically crop images to emphasize the most important areas.Explore Demo
Detect Faces in an ImageDetect the location of human faces and their attributes in images.Explore Demo
Count People in an AreaAnalyze video to count the number of people in a designated zone.Explore Demo
Detect When People Cross a LineDetect when a person crosses a line in the camera's field of view.Explore Demo





Language Studio​




Language Processing Scenarios​




ScenarioDescriptionLink
Extract PIIIdentify sensitive personally identifiable information (PII) in text.Explore Demo
Extract Key PhrasesQuickly identify the main points from unstructured text.Explore Demo
Find Linked EntitiesDisambiguate the identity of entities found in text by linking to a knowledge base.Explore Demo
Extract Named EntitiesIdentify and categorize entities in text using Named Entity Recognition (NER).Explore Demo
Extract Health InformationExtract and label medical information from unstructured texts.Explore Demo
Analyze Sentiment and OpinionsProvide sentiment labels and confidence scores at the sentence and document level.Explore Demo
Detect LanguageDetermine the language used in the input document and return a confidence score.Explore Demo
Custom Text ClassificationCreate custom text classification projects with labeled data and trained models.Explore Demo
Answer QuestionsExtract answers to questions from passages of text provided.Explore Demo
Conversational Language Understanding ProjectsBuild projects with labeled data and trained models for understanding conversational language.Explore Demo
Orchestration ProjectsBuild and manage projects that integrate multiple language services.Explore Demo
Summarize InformationProduce summaries for conversations or documents using summarization APIs.Explore Demo

Document Translation

Batch translate documents into one or more languages either from local storage or Azure Blob Storage

Explore Demo





Document Intelligence​




Document Analysis Scenarios​




ScenarioDescriptionLink
ReadExtract printed and handwritten texts along with barcodes and formulas from documents.Explore Demo
LayoutExtract tables, checkboxes, and text from forms and images.Explore Demo
General DocumentsExtract key-value pairs and structure from any form or document.Explore Demo

Prebuilt Models Scenarios


Scenario

Description

Invoices

Extract invoice details including customer and vendor details, totals, and line items.

Receipts

Extract transaction details from receipts including date, merchant information, and totals.

Identity Documents

Extract details from passports and ID cards.

US Health Insurance Cards

Extract details from US health insurance cards.

US personal tax

Classify then extract information from documents containing any number of W2s, 1040s, 1098s and 1099s.

US mortgage

Extract information from a variety of mortgage

US pay stubs

Extract employee information, payment information including earnings, deductions, net pay and more.

US bank statements

Extract bank statements

US checks

Extract amount, date, pay to order MICR numbers, name and address of the player, and more.

Marriage Certificates

Extract details from marriage certificates.

Credit Cards

Extract details from credit cards including card number and cardholder name.

Contracts

Extract title and signatory parties' information from contracts.

Business Cards

Extract contact details from business cards.



Gen-AI Safety Solutions​




Safeguard your image content​



Scenario

Description

Link

Moderate image content

This is a tool for evaluating different content moderation scenarios. It takes into account various factors such as the type of content, the platform's policies, and the potential impact on users. Run moderation tests on sample content. Use Configure filters to rerun and further fine tune the test results. Add specific terms to the block list that you want detect and act on.

Explore Demo

Moderate Multimodal content

Run moderation test on image and text combined contents. Assess the test results with detected severities.

Private Preview

Safeguard your Text​



Scenario

Description

Link

Moderate text content

Run moderation tests on text contents. Assess the test results with detected severities.

Explore Demo

Groundedness Detection

Groundedness detection detects ungroundedness generated by the large language models (LLMs).

Private Preview

Protected Material Detection

Detect and protect third-party text material in LLM modules.

Explore Demo

Prompt Shield

Prompt shields provides a unified API that addresses the following types of attacks: Jailbreak attacks and Indirect attacks.

Explore Demo

Real-time Safety Measures​



Scenario

Description

Link

Monitor Online Activity

This will display your API usage, moderation results, and their distributions per category. You can customize the severity threshold for each category to view the updated results and deploy the new threshold to your end. Additionally, you can edit the blocklist on this page to respond to any incidences.

Explore Demo

Continue reading...
 
Back
Top