Latency and Cloud: Why Hosting Near the User Still Matters

Latency often appears late in infrastructure discussions. First, the focus is on price, capacity, cloud region, scalability, security, or managed services. Only after the application is in production do metrics arrive: pages taking longer to respond, abandoned carts, waiting APIs, unconverted mobile sessions, and users perceiving the service as slow even though the server “works fine.”

In markets like Argentina, this discussion has a very concrete interpretation. Hosting an application from local infrastructure is not the same as doing so from São Paulo, Miami, Virginia, Madrid, or Frankfurt. The difference may seem small when expressed in milliseconds, but it multiplies with each request, each TCP/TLS negotiation, API call, or dynamic resource that can’t be cached simply.

RTT is not a detail: it’s the distance turned into experience

When talking about network latency, RTT, or round-trip time, is usually used. It measures how long it takes for a packet to go from one point to another and back. It’s a simple metric but very useful to understand why physical infrastructure location still matters. Fiber optics don’t eliminate geography. They reduce, route, and optimize it, but each hop, international route, and exchange point adds time.

In a modern web application, a 100-millisecond difference doesn’t just affect a single request. It can involve DNS resolution, connection establishment, TLS handshake, initial HTML load, backend calls, blocking resources, payment endpoints, login, or microservice communication if the system is distributed across regions.

The following table uses Buenos Aires as a reference point and summarizes approximate RTT values published by public measurements between cities. These figures should not be read as performance guarantees, since each operator, route, transit provider, and time of day can change the result. They serve to visualize the order of magnitude.

Infrastructure LocationApproximate RTT from Buenos AiresDifference from localTechnical note
Argentina local~2-5 msReferenceAdequate for latency-sensitive loads and national users
Santiago de Chile~21-25 ms+20 ms approx.Good regional option if connectivity is sufficient
São Paulo~30-35 ms+30 ms approx.Nearby cloud region for many South American deployments
Miami~110-135 ms+110-130 msCommon link to US, but far for local experience
New York / Virginia~140-150 ms+140-150 msFrequent default region in many cloud architectures
London~220-225 ms+220 ms approx.Strong for Europe, less efficient for Argentine users
Madrid~230-235 ms+230 ms approx.Good for Spain, not ideal for low latency in Argentina
Frankfurt~235-245 ms+235-240 msMature European region, but very distant for Argentine traffic

Madrid is a good example to avoid a common misconception. As a European region, it can be an excellent location to serve users in Spain, Portugal, Southern Europe, or certain transatlantic interconnection scenarios. But if the end user is in Buenos Aires, Córdoba, Rosario, or Mendoza, Madrid is “not close.” Every interaction crosses the Atlantic, and this is reflected in RTT.

The technical cost of placing the server far away

The economic impact of latency has been often cited with two well-known references. Greg Linden, a former Amazon engineer, explained that internal tests introduced delays of 100 milliseconds, and even small delays caused significant revenue drops. The phrase has been summarized over the years as “an extra 100 ms can cost 1% of sales.” It’s useful to see this as a historical reference, not a universal formula.

The second reference comes from the study “Milliseconds Make Millions,” conducted by Deloitte for Google. The report observed that a 0.1-second improvement in various mobile speed metrics was associated with an 8.4% increase in retail conversions and 10.1% in travel. Again, this doesn’t guarantee that any web will replicate those numbers, but it confirms what performance teams have been seeing: when mobile experience improves, conversion tends to follow.

Applied to an e-commerce business with $1 million annual digital revenue, the simplified calculation based on Amazon’s reference would be:

LocationApproximate additional latency compared to localEstimated annual impact on $1M*
Argentina local0 msReference
Santiago de Chile+20 ms~$2,000
São Paulo+30 ms~$3,000
Miami+130 ms~$13,000
New York / Virginia+145 ms~$14,500
London+220 ms~$22,000
Madrid+230 ms~$23,000
Frankfurt+240 ms~$24,000

*Estimate based on the historical rule: 100 ms additional ≈ 1% less sales. Does not replace actual measurements with your own metrics.

The exact figure can vary greatly. It’s different for a site with massive mobile traffic than for a corporate portal, a financial API, a B2B SaaS, or a gaming platform. But the principle remains: the more interactive and transactional an application is, the more expensive it becomes to keep the backend close to the user.

Different problems: local, edge, CDN, and regional cloud do not solve the same issues

A CDN can clearly improve static content delivery: images, CSS, JavaScript, videos, fonts, or downloads. It can also reduce origin load and improve response times on cacheable resources. But not everything can be cached. Login, stock queries, payment operations, cart updates, personalized recommendations, or database calls need to reach the backend.

That’s where infrastructure location becomes central again. A design with frontend cached at the edge but dynamic backend in Virginia can deliver an acceptable initial load but still penalize each relevant user action. On mobile, this situation worsens due to network variability, packet loss, cell changes, and less stable connections.

LayerWhat improvesWhat it doesn’t solve alone
CDNStatic resources, cache, downloads, video, imagesDynamic, non-cacheable operations
Edge functionsLight logic near the userComplex databases or transactional state far away
Nearby cloud regionLower RTT than distant regionsDependency on services available in that region
Local infrastructureLow latency for domestic usersRequires good connectivity, operation, and resilience design
Multi-regionResilience and proximity per marketIncreased data complexity, consistency, and cost

In Argentina, there’s an additional peculiarity: major hyperscalers do not show a complete public region in the country on their official maps. AWS lists a Local Zone in Buenos Aires, associated with the primary US East (N. Virginia) region, allowing certain resources to be closer to end users. But a Local Zone is not the same as a full region with the complete catalog of services. Azure, Google Cloud, and Oracle publish global region maps where coverage in Latin America is organized around other countries and locations.

Therefore, the decision should not be framed as “public cloud or local infrastructure,” but as a hybrid and realistic architecture. Some workloads fit well into global cloud regions: analytics, backup, disaster recovery, managed services, AI, development environments, or applications with distributed users. Others, especially those sensitive to latency and with users concentrated in a single country, can benefit from a local placement.

How to measure before migrating a workload

Latency shouldn’t be estimated solely from a table. Before migrating, it’s essential to measure from the points where actual users are located. For an Argentine ecommerce, that means testing from Buenos Aires, Córdoba, Rosario, Mendoza, Tucumán, and other relevant zones. For a B2B SaaS, it also matters which corporate networks clients connect from. For a financial API, measurements should include the app, PSPs, banks, anti-fraud services, and third-party providers.

A useful test should include, at minimum, RTT, jitter, packet loss, TTFB, p95 and p99 response times, TLS handshake, database query times, and mobile device performance. The average can be misleading. An app may have an acceptable average latency but suffer from 99th percentile queues that ruin the experience during critical moments.

It’s also important to differentiate processing latency from network latency. A slow server in Argentina may perform worse than an optimized backend in São Paulo. But when both are well designed, geography matters. No CPU tuning can fully compensate for a RTT of 230 ms if the user is in Buenos Aires and the main backend is in Madrid.

The technical conclusion is simple: latency doesn’t show up as a line item on the provider invoice, but it appears in metrics. It influences conversions, abandonment, TTFB, API response times, payment processes, and perceived quality. For many companies, choosing where an application runs can be as critical as choosing the platform itself.

Frequently Asked Questions

What is network latency?
It’s the time for data to travel between the user and the server. Usually measured as RTT, the round-trip time of a packet.

Why is it so important in ecommerce?
Because each delay can affect page load, search, cart, login, and payment. On mobile, small delays tend to be more noticeable due to network variability.

Does a CDN eliminate the latency problem?
Not entirely. It helps greatly with static content and cacheable resources but dynamic operations still depend on where the backend, database, and APIs are located.

Is Madrid a good location to serve users in Argentina?
Madrid can be excellent for Spain and Europe, but from Argentina, RTT is roughly over 230 ms. For latency-sensitive applications, a local or regional placement is usually more suitable.

Scroll to Top