Does Thunder Compute charge for egress when downloading generated images?

No, egress is free on Thunder Compute as of May 2026.

Is Thunder Compute A100 80GB available 24/7?

Thunder Compute provides Datacenter GPU access with generally high availability, though individual instances may need to be restarted if hardware fails.

How does Thunder Compute compare to Vast.ai for A100 80GB pricing?

Both Thunder Compute and Vast.ai offer GPU compute marketplaces. Thunder Compute currently lists the A100 80GB at $0.780/hr. Vast.ai shows RTX 4090 at around $0.67/hr average (range $0.36-$1.20). Prices fluctuate based on supply.

Can I run ComfyUI on Thunder Compute A100 80GB?

Yes. The A100 80GB with 80 GB VRAM supports all standard ComfyUI workflows including Flux Dev, Flux Schnell, SDXL, and most ControlNet pipelines. Provision the instance, install ComfyUI and dependencies, and expose the API port.

What VRAM does the A100 80GB have on Thunder Compute?

The A100 80GB on Thunder Compute has 80 GB of VRAM. This is sufficient for Flux Dev (requires ~16-24 GB), SDXL (requires ~8-12 GB), and most ComfyUI workflows.

Thunder Compute A100 80GB Pricing 2026: Real Cost Breakdown

Q: What is the estimated cost per image on Thunder Compute A100 80GB for Flux Dev?

At $0.780/hr and approximately 950 Flux Dev images/hr throughput, the estimated cost is $0.00082/image. This is an estimate; actual throughput depends on your workflow, batch size, and resolution.

Q: What is the estimated cost per image on Thunder Compute A100 80GB for Flux Schnell?

At $0.780/hr and approximately 6,000 Flux Schnell images/hr, the estimated cost is $0.00013/image. This is an estimate.

Thunder Compute rents A100 80GB GPUs at $0.780/hr (Datacenter) as of May 2026. The A100 80GB has 80 GB of VRAM, which is sufficient for running Flux Dev, Flux Schnell, SDXL, and most production diffusion workloads. This page covers the exact pricing structure, estimated cost per generated image for common AI models, how Thunder Compute compares to alternatives for the same GPU, and when this combination makes sense for your workload.

Thunder Compute A100 80GB: Current Price Per Hour

Thunder Compute charges $0.780/hr for the A100 80GB on its Datacenter tier. Thunder Compute uses a virtualised GPU technology that allows multiple workloads to share physical hardware. As of May 2026 it offers the lowest published H100 rate of any provider. Verify CUDA workload compatibility before committing.

A100 80GB pricing across providers - verified May 2026

Provider	Price/hr	Tier	Notes
TensorDock	$0.750	Marketplace	-
Thunder Compute	$0.780	Datacenter	Virtualized GPU technology - check compatibility w
Vast.ai	$0.901	Marketplace	-
RunPod	$1.19	Community	-
Crusoe	$1.80	Datacenter	Clean energy (stranded natural gas). Good for ESG-
Modal	$2.10	Serverless	Billed per second, no idle costs
CoreWeave	$2.70	Enterprise	Approximate - CoreWeave pricing requires account q
Lambda	$2.79	Datacenter	Price is per GPU in 8x A100 instance ($22.32/hr to

$0.780

Thunder Compute A100 80GB price per hour (Datacenter) - verified May 2026

https://www.thundercompute.com/pricing

Pricing Tiers and Instance Options on Thunder Compute

Thunder Compute operates a datacenter model for GPU compute. The $0.780/hr rate is the standard Datacenter price. Notes from the provider: Virtualized GPU technology - check compatibility with CUDA workloads The Datacenter tier suits most AI image generation workloads. For compliance-sensitive applications (healthcare data, financial records), datacenter-grade infrastructure with SLA guarantees is preferable over community hardware.

Monthly cost projections for the A100 80GB on Thunder Compute: 8 hrs/day (business hours only) = $187.20/month; 12 hrs/day = $280.80/month; 24 hrs/day (always-on) = $561.60/month. For batch inference jobs that run only while processing, the always-on figure is the ceiling, not the expected cost. A 10,000-image batch on this GPU takes approximately 10 hours for Flux Dev and 1 hours for Flux Schnell.

Estimated Cost Per Image: Flux Models on Thunder Compute A100 80GB

GPU rental providers charge by the hour, not by the image. To estimate per-image cost, divide the hourly rate by images generated per hour. The throughput figures below are estimates based on typical single-GPU performance at default batch size and resolution (1024x1024). Actual throughput varies by ComfyUI workflow complexity, LoRA count, and batch size.

Estimated cost per image on Thunder Compute A100 80GB at $0.780/hr - May 2026

Model	Est. throughput	Est. cost/image	Monthly cost at 10K images
Flux Schnell	~6,000 images/hr	~$0.00013	$1.30
Flux Dev	~950 images/hr	~$0.00082	$8.21
SDXL	~1,200 images/hr	~$0.00065	$6.50

These are estimates. Run your specific workflow on the GPU for 30 minutes and measure actual throughput before committing to long-running batch jobs. Throughput scales near-linearly with batch size up to the VRAM limit (80 GB on this GPU).

Thunder Compute A100 80GB vs Alternative Providers

The table above shows Thunder Compute against every provider that offers the A100 80GB. At $0.780/hr, Thunder Compute is one of the cheaper the options for the A100 80GB as of May 2026. The cheapest is TensorDock at $0.750/hr. Price is not the only factor: uptime guarantees, spot availability, and egress costs all affect the total cost of ownership.

For workloads that tolerate interruptions (batch jobs with retry logic), marketplace and community providers offer significant savings over datacenter providers. For latency-sensitive production APIs where a failed GPU instance directly affects users, pay the premium for a provider with uptime SLAs and guaranteed hardware availability.

Spot Pricing and Instance Availability on Thunder Compute

Thunder Compute does not use a traditional spot market. Pricing is fixed at $0.780/hr with no interruptible tier.

For AI image generation workloads, instance availability on Thunder Compute varies by GPU model and time of day. The A100 80GB is less commonly available than RTX 4090 instances on most platforms. Secure-tier and datacenter-grade A100 80GB instances are scarcer than community-tier equivalents. For large batch jobs, provision the instance during off-peak hours (early morning UTC) when availability tends to be higher.

Storage and Hidden Costs on Thunder Compute

The $0.780/hr rate covers GPU compute only. Additional costs to consider: storage (typically $0.01-$0.10/GB/month for model weights and outputs), egress (free on {provider} as of May 2026), and CPU/RAM for the container running your inference server. Model weights for Flux Dev are approximately 24 GB; SDXL is approximately 7 GB. If you store these on persistent storage and mount them at runtime, you pay for storage continuously but avoid re-downloading weights on each run.

A minimal production cost estimate for Thunder Compute A100 80GB: GPU compute at $0.780/hr plus roughly $20-50/month in storage for model weights. Egress is free, so transferring generated images out costs nothing on Thunder Compute. Budget accordingly if you are comparing total cost of ownership against managed API providers.

Networking costs are often overlooked when estimating self-hosted GPU costs. If your application servers are in a different provider or region than your GPU instances, you pay egress fees from both sides of the connection. Colocating your inference server and your application layer in the same provider and region eliminates these fees and also reduces latency by 20-40 ms compared to cross-provider calls. For latency-sensitive applications (real-time image generation with user-facing UX), network topology is as important as GPU speed.

When Thunder Compute A100 80GB Is the Right Choice

Thunder Compute A100 80GB is the right choice when you need direct GPU access for a custom model, a fine-tuned LoRA, or a ComfyUI workflow that a managed API does not support. It is also cost-effective at high volume: the estimated Flux Dev cost of $0.00082/image on this GPU is significantly cheaper than managed APIs at $0.0250/image (fal.ai/Replicate) or $0.0154/image (Together AI).

Thunder Compute A100 80GB is less suitable when you need zero-infrastructure operations or when your team does not have DevOps capacity to manage GPU infrastructure. The engineering cost of running a reliable GPU cluster (monitoring, autoscaling, driver updates, model management) is typically $8,000-$12,000/month for a two-engineer team. Factor this into your total cost comparison when evaluating self-hosted versus managed API.

Teams that get the most value from Thunder Compute A100 80GB share a common profile: they have an existing Python-based ML stack, at least one engineer who is comfortable managing Linux servers and CUDA environments, and workloads that benefit from custom model configurations unavailable on managed platforms. If your team is primarily a product or application team rather than an ML infrastructure team, a managed inference API is almost always the better starting point. You can always migrate to self-hosted GPU once volume justifies the infrastructure investment.

How to Minimise Your Thunder Compute A100 80GB Bill

Three tactics reduce cost when renting A100 80GB time on Thunder Compute: use spot or community instances for batch jobs, maximise GPU utilisation during paid time, and choose the right GPU for your model. For Flux Schnell workloads, the A100 80GB is well-matched. For SDXL or Flux Dev with LoRAs, measure whether an A100 (higher throughput) is more cost-effective than an RTX 4090 for your specific batch size.

Batching images increases GPU utilisation and throughput. A batch of 4 images on the A100 80GB takes only slightly longer than a batch of 1, so effective throughput per hour is higher when processing in bulk. For always-on production APIs with variable traffic, consider a hybrid architecture: a managed inference API for low-traffic periods (paying only per image) and a self-hosted A100 80GB for high-traffic windows (paying fixed hourly rate).

Model weight caching is one of the highest-impact optimisations for GPU rental cost. Load model weights into GPU memory at instance startup and keep the inference server running between requests rather than loading weights per job. For Flux Dev, cold weight loading takes 30-60 seconds on the A100 80GB; a warm server responds in under 2 seconds. At 100 requests per hour, cold-loading per request wastes more than 50 minutes of GPU time per hour, directly multiplying your effective cost per image.

For batch processing workflows, a queue-based architecture eliminates idle GPU time entirely. A message queue (Redis, SQS, or a simple database table) holds pending jobs. A worker script rents a A100 80GB on Thunder Compute, drains the queue until empty, then terminates the instance automatically. This pattern is the most cost-effective for workloads that do not require constant uptime. At $0.780/hr, even 2 hours of idle time per day accumulates to $46.80/month in wasted compute.

When comparing providers for your next GPU rental decision, measure total cost of ownership rather than hourly rate alone. A provider with free egress and reliable availability may cost less in practice than a cheaper hourly rate with high egress fees or frequent instance interruptions. Run a 24-hour benchmark with your actual workflow on two providers before committing to long-running batch work. Effective throughput and instance reliability vary significantly between providers even for the same GPU model.

Final recommendation: Thunder Compute A100 80GB at $0.780/hr suits teams that need direct GPU access, custom model configurations, or cost savings at high volume. For Flux Dev at the estimated $0.00082/image, the savings versus managed APIs ($0.025/image on Replicate/fal.ai) are significant at any volume above 5,000 images per month. The trade-off is infrastructure complexity: allocate engineering time for deployment, monitoring, and maintenance, or factor in a managed orchestration layer like Runflow that handles GPU lifecycle management on top of commodity GPU providers. The right choice depends on your team capacity and whether GPU infrastructure is a core competency or a cost centre you want to minimise.