Why Amazon EC2 Inf1 Instances?
Businesses across a diverse set of industries are looking at artificial intelligence (AI)–powered transformation to drive business innovation and improve customer experience and process improvements. Machine learning (ML) models that power AI applications are becoming increasingly complex, resulting in rising underlying compute infrastructure costs. Up to 90% of the infrastructure spend for developing and running ML applications is often on inference. Customers are looking for cost-effective infrastructure solutions for deploying their ML applications in production.
Amazon EC2 Inf1 instances deliver high-performance and low-cost ML inference. They deliver up to 2.3x higher throughput and up to 70% lower cost per inference than comparable Amazon EC2 instances. Inf1 instances are built from the ground up to support ML inference applications. They feature up to 16 AWS Inferentia chips, high-performance ML inference chips designed and built by AWS. Additionally, Inf1 instances include 2nd Generation Intel Xeon Scalable processors and up to 100 Gbps networking to deliver high throughput inference.
Customers can use Inf1 instances to run large-scale ML inference applications such as search, recommendation engines, computer vision, speech recognition, natural language processing (NLP), personalization, and fraud detection.
Developers can deploy their ML models to Inf1 instances by using the AWS Neuron SDK, which is integrated with popular ML frameworks such as TensorFlow, PyTorch, and Apache MXNet. They can continue using the same ML workflows and seamlessly migrate applications onto Inf1 instances with minimal code changes and with no tie-in to vendor-specific solutions.
Get started easily with Inf1 instances using Amazon SageMaker, AWS Deep Learning AMIs (DLAMI) that come preconfigured with Neuron SDK, or Amazon Elastic Container Service (Amazon ECS) or Amazon Elastic Kubernetes Service (Amazon EKS) for containerized ML applications.
Amazon EC2 Inf1 Instances
Benefits
Features
Customer and Partner testimonials
Here are some examples of how customers and partners have achieved their business goals with Amazon EC2 Inf1 instances.
-
Snap Inc.
-
Sprinklr
-
Finch Computing
-
Dataminr
-
Autodesk
-
Screening Eagle Technologies
-
NTT PC Communications
NTT PC Communications, a network service and communication solution provider in Japan, is a telco leader in introducing new innovative products in the information and communication technology market.
-
Anthem
Anthem is one of the nation's leading health benefits companies, serving the healthcare needs of 40+ million members across dozens of states.
-
Condé Nast
-
Ciao Inc.
-
The Asahi Shimbun Company
-
CS Disco
-
Talroo
-
Digital Media Professionals
-
Hotpot.ai
Hotpot.ai empowers non-designers to create attractive graphics and helps professional designers automate rote tasks.
-
SkyWatch
-
Money Forward Inc.
Money Forward Inc. serves businesses and individuals with an open and fair financial platform. As part of this platform, HiTTO Inc., a Money Forward group company, offers an AI chatbot service that uses tailored NLP models to address the diverse needs of their corporate customers.
-
Amazon Advertising
Amazon Advertising helps businesses of all sizes connect with customers at every stage of their shopping journey. Millions of ads, including text and images, are moderated, classified, and served for the optimal customer experience every single day.
Read the news blog -
Amazon Alexa
-
Amazon Prime Video
-
Amazon Rekognition and Video
Product details
* Prices shown are for US East (Northern Virginia) AWS Region. Prices for 1-year and 3-year reserved instances are for "Partial Upfront" payment options or "No Upfront" for instances without the Partial Upfront option.
Amazon EC2 Inf1 instances are available in the US East (N. Virginia), US West (Oregon) AWS Regions as On-Demand, Reserved, or Spot Instances.