Serverless is a pay-per-second computing solution designed for dynamic autoscaling in production environments. It automatically adjusts computational resources based on your request traffic, ensuring cost-effective usage.We offer both GPU and CPU serverless options:
GPU Serverless: Each worker is equipped with a dedicated GPU, ideal for AI/ML workloads.
CPU Serverless: Workers come with high-clock-speed CPU cores, suited for general-purpose workloads.
A worker is a single compute resource that processes Serverless endpoint requests. Each endpoint can have multiple workers, enabling parallel processing of multiple requests simultaneously.
An endpoint refers to a specific REST API (URL) provided by Runpod that your applications or services can interact with. These endpoints enable standard functionality for submitting jobs and retrieving their outputs.
A handler is a function you create that takes in submitted inputs, processes them (like generating images, text, or audio), and returns the final output.
The Serverless SDK is a Python package used when creating a handler function. This package helps your code receive requests from our serverless system, triggers your handler function to execute, and returns the function’s result back to the serverless system.
A data center is a secure location where Runpod’s cloud computing services, such as GPU instances and storage instances, are hosted. These data centers are equipped with redundant power, multiple ISP connections, and data backups to ensure the safety and reliability of your compute services and data.
A GPU instance is a container-based compute resource that you can deploy.These instances spin up in seconds using both public and private repositories. They are available in two different types:
A Runpod template is a Docker container image paired with a configuration. Templates define the required container disk size, volume, volume paths, ports, and environment variables needed to launch Pods.There are three types of templates:
Official templates: Official templates created and maintained by Runpod with full support.
Community templates: User-contributed templates shared by the community (Runpod does not maintain these, or provide customer support for them).
Private templates: Custom templates created by you or your team.
Runpod provides several Software Development Kits (SDKs) you can use to interact with the Runpod platform. These SDKs enable you to create serverless functions, manage infrastructure, and interact with APIs.