Modal
modal.com
WebsitesModal is a platform for running Python code in the cloud with minimal
llms.txt
Modal llms.txt
Modal is a platform for running Python code in the cloud with minimal configuration, especially for serving AI models and high-performance batch processing. It supports fast prototyping, serverless APIs, scheduled jobs, GPU inference, distributed volumes, and sandboxes.
Important notes:
- Modal's primitives are embedded in Python and tailored for AI/GPU use cases, but they can be used for general-purpose cloud compute.
- Modal is a serverless platform, meaning you are only billed for resources used and can spin up containers on demand in seconds.
You can sign up for free at [https://modal.com] and get $30/month of credits.
Guide
- Introduction
- Custom container images
- GPUs and other resources
- Scaling out
- Deployment
- Modal Sandboxes
- Modal Notebooks
- Secrets and environment variables
- Scheduling and cron jobs
- Web Functions
- Networking
- Data sharing and storage
- Performance
- Reliability and robustness
- Security and privacy
- Integrations
- Workspace & account settings
- Other topics
- Feature maturity
- JavaScript/Go SDKs
- Modal 1.0 migration guide
- File and project structure
- Developing and debugging
- Developing Modal code with LLMs
- Jupyter notebooks
- Asynchronous API usage
- Global variables
- Region selection
- Container lifecycle hooks
- Parametrized functions
- Dynamic function configuration
- S3 Gateway endpoints
- GPU Metrics
Examples
- Featured
- Getting started
- Large language models (LLMs)
- Deploy an OpenAI-compatible LLM service with vLLM
- Cut Ministral 3 cold start times by 10x with snapshots
- Maximize tokens per second in batch processing with vLLM
- Serve an ultra-low-latency chatbot with SGLang
- Deploy Nemotron 3
- Efficient LLM Finetuning with Unsloth
- Run a multimodal RAG chatbot to answer questions about PDFs
- Fine-tune an LLM to replace your CEO
- Deploy a stateless MCP with FastMCP
- Images, video, & 3D
- Audio
- Real-time communication (WebRTC)
- Computational biology
- Modal Sandboxes
- Run a background coding agent with OpenCode
- Build a scalable AI coding platform
- Create GIFs from Slack using the Claude Agent SDK
- Run a LangGraph agent's code in a secure GPU sandbox
- Control a sandboxed computer with an LLM
- Build a stateful, sandboxed code interpreter
- Run Node.js, Ruby, and more in a Sandbox
- Speed up Sandbox starts with warm pools
- Reinforcement Learning
- Embeddings
- Parallel processing and job scheduling
- Training models from scratch
- Hosting popular libraries
- Connecting to other APIs
- Managing data
- Miscellaneous
API Reference
- Changelog
- Application construction
App: The main unit of deployment for code on ModalApp.function: Decorator for registering a function with an AppApp.cls: Decorator for registering a class with an App
- Serverless execution
- Extended Function configuration
- Class parametrization
parameter: Used to define class parameters, akin to a Dataclass field
- Lifecycle hooks
- Web integrations
fastapi_endpoint: Decorator for exposing a simple FastAPI-based endpointasgi_app: Decorator for functions that construct an ASGI web applicationwsgi_app: Decorator for functions that construct a WSGI web applicationweb_server: Decorator for functions that construct an HTTP web server
- Function semantics
batched: Decorator that enables dynamic input batchingconcurrent: Decorator that enables input concurrency
- Scheduling
- Exception handling
Retries: Function retry policy for input failures
- Class parametrization
- Sandboxed execution
Sandbox: An interface for restricted code executionContainerProcess: An object representing a sandboxed processFileIO: A handle for a file in the Sandbox filesystem
- Container configuration
- Data primitives
- Persistent storage
Volume: Distributed storage supporting highly performant parallel readsCloudBucketMount: Storage backed by a third-party cloud bucket (S3, etc.)NetworkFileSystem: Shared, writeable cloud storage (superseded bymodal.Volume)
- In-memory storage
- Persistent storage
- Networking
Related
llmtxt.app – AI SEO & Search Engine Optimization Directory
/llms.txt
635 tokens
/llms-full.txt
2,429 tokens
Websites