![Serve 3,000 deep learning models on Amazon EKS with AWS Inferentia for under $50 an hour | AWS Machine Learning Blog Serve 3,000 deep learning models on Amazon EKS with AWS Inferentia for under $50 an hour | AWS Machine Learning Blog](https://d2908q01vomqb2.cloudfront.net/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59/2021/09/13/ML5291-archdiag.png)
Serve 3,000 deep learning models on Amazon EKS with AWS Inferentia for under $50 an hour | AWS Machine Learning Blog
![GPU-Accelerated Amazon Web Services | Boost Performance and Scale Deep Learning and HPC Applications. GPU-Accelerated Amazon Web Services | Boost Performance and Scale Deep Learning and HPC Applications.](https://www.nvidia.com/content/dam/en-zz/Solutions/Data-Center/gpu-cloud-computing/amazon-data-center-gpu-cloud-social-media.jpg)
GPU-Accelerated Amazon Web Services | Boost Performance and Scale Deep Learning and HPC Applications.
![GPU-Accelerated Amazon Web Services | Boost Performance and Scale Deep Learning and HPC Applications. GPU-Accelerated Amazon Web Services | Boost Performance and Scale Deep Learning and HPC Applications.](https://www.nvidia.com/content/dam/en-zz/Solutions/Data-Center/gpu-cloud-computing/google-cloud-platform/nvidia-csp-partner-google-cloud/nvidia-tesla-v100-3c33-p@2x.png)
GPU-Accelerated Amazon Web Services | Boost Performance and Scale Deep Learning and HPC Applications.
![Develop, Deploy, and Distribute Immersive Experiences with NVIDIA CloudXR and Amazon Web Services | NVIDIA Technical Blog Develop, Deploy, and Distribute Immersive Experiences with NVIDIA CloudXR and Amazon Web Services | NVIDIA Technical Blog](https://developer-blogs.nvidia.com/wp-content/uploads/2021/12/AWS-architecture.png)