Fal.ai logo

Fal.ai πŸ‡ΊπŸ‡Έ

Serverless platform for running AI models

Founded in 2021, Fal.ai is a cloud platform for deploying AI models with a focus on inference for generative content. It allows developers to run and fine-tune models without managing complex infrastructure.

Example customers include PlayAI, Quora Poe, Genspark, Hedra.

Fal.ai Homepage

Fal.ai Homepage

What's good about Fal.ai

  • Optimized for fast inference, especially for generative media
  • Cost-effective, pay-as-you-go pricing model
  • Offers both serverless GPU instances

Fal.ai pricing examples

Fal.ai uses a usage-based pricing model, ensuring you only pay for the compute you consume. It offers two main structures:

  • GPU Pricing: Billed per second for deploying custom applications on their GPU fleet.
  • Output-Based Pricing: For models hosted by Fal.ai, billing is based on the output generated, such as per image, per megapixel, or per second of video.

GPUs are available in the following configurations:

Name GPUs VRAM vCPUs RAM Price/h
A6000 1x A6000 48GB -- -- $0.60 Source
A100 1x A100 40GB -- -- $0.99 Source
H100 1x H100 80GB -- -- $1.89 Source
H200 1x H200 141GB -- -- $2.10 Source
B200 1x B200 184GB -- -- On Request Source

Note: Our pricing examples are based on several assumptions. Your actual costs may differ. Always check the cloud provider's website for the most up-to-date pricing. You can find Fal.ai's latest pricing here.

Which services does Fal.ai offer