GPU as a Service: The Catalyst for AI Growth in ASEAN, Opportunities, and Challenges

F5 Ecosystem | October 14, 2024

As artificial intelligence (AI) continues to advance, the demand for high-performance hardware is skyrocketing. Organizations are finding it increasingly difficult to keep pace with the computing power required to run complex AI models and workloads. This is where GPU as a Service (GPUaaS) comes in.

By offering on-demand access to powerful graphics processing units (GPUs) over the cloud, GPUaaS is transforming the way businesses approach AI infrastructure. It eliminates the need for costly hardware investments, allows seamless scaling, and integrates smoothly with existing cloud services—all while simplifying operations. But how exactly does GPUaaS work, and why is it becoming the go-to solution for AI-driven organizations?

Unlocking AI potential in ASEAN

In the Association of Southeast Asian Nations (ASEAN), the GPUaaS market is expanding as more players enter the space to address specific regional challenges. One key factor driving this growth is language. Open-source large language models (LLMs) are predominantly trained in English, and they often struggle with local languages that are rich in cultural nuances. As a result, organizations need to re-train or fine-tune these models with local data to ensure more accurate and relevant responses in native languages.

At the same time, the benefits of using GPUaaS are helping fuel its adoption. Scalability allows users to effortlessly adjust GPU resources based on project needs. While elasticity, through a pay-per-use model, enables organizations to reduce overall expenses by only paying for what they use. GPUaaS also grants immediate access to cutting-edge technology, facilitating rapid prototyping and deployment that increases flexibility and reduces time to market.

Another important consideration is data gravity, residency, and sovereignty. Data gravity refers to the tendency of data to attract applications and services to its location for better performance and efficiency. In many cases, data must reside in specific locations due to residency and sovereignty regulations, which means that GPUaaS providers need to be located near their user bases. Sovereign AI, which emphasizes a nation's ability to develop AI using its own infrastructure, data, and resources, also plays a significant role in shaping the demand for localized GPUaaS.

Lastly, the cost and limited supply of GPUs within cloud service providers (CSPs) in ASEAN are considering factors when it comes to adopting GPUaaS. According to a recent Dell report, on-premises AI deployments can yield up to 75% savings compared to CSP-based solutions. GPUaaS offers a cost-efficient alternative, allowing organizations to access high-performance GPUs without investing in a significant amount of hardware upfront, making it an attractive option for those seeking to scale their AI capabilities in the region.

Balancing the benefits and risks of GPUaaS

While the benefits of GPUaaS help drive its widespread adoption, they also bring their own set of concerns. One key issue is data security, as data transmitted to and from GPUs can be vulnerable to interception or unauthorized access. Furthermore, processing data on remote GPUs may involve navigating varying data protection regulations and compliance requirements. Another concern is performance, where reliance on Internet, or private connectivity, and fluctuating GPU performance could affect application speed and responsiveness. GPUaaS depends on stable, high-speed connections, often favoring private networks over public Internet for optimal performance.

How does F5 help?

F5 offers innovative, multicloud SaaS-based networking, traffic optimization, and security services for public and private clouds, including GPUaaS providers, through a single console.

By forming an encrypted mesh fabric overlay on top of any network, organizations can connect to a GPUaaS provider (AI factory) for AI inference, embedding, or training. With full network and application segmentation, all overlay connectivity is private and secure, built on top of an existing network underlay. In addition, F5 encrypted mesh fabric addresses digital resiliency challenges by dynamically monitoring, detecting, optimizing, and delivering traffic to those healthy AI components—ensuring your AI applications are always up and available.

Below is an example of an LLM retrieval augmented generation (RAG) deployment, leveraging an AI factory from a GPUaaS provider. As data is safely transported across the secure mesh with encryption, there is no concern about data in transit; there is also no data at rest happening at the GPUaaS provider. An organization’s corpus data at rest remains at the original location without any change. This architecture also allows for AI inference to take place for latency-sensitive applications at the edge (a public cloud or even branch offices).

If the AI application (such as an agentic RAG-enabled chatbot) is made accessible from the Internet, it is important to consider a cloud-network-based web app and API protection (WAAP) service to protect the AI application from cyberattacks.

Figure 1: An example of an LLM retrieval augmented generation (RAG) deployment, leveraging an AI Factory from a GPUaaS provider

By leveraging a platform with a single management console, it is now possible to have compliance, observability, and control of all traffic, including APIs traversing North/South and East/West across the encrypted mesh fabric.

Figure 2: API discovery, posture management, and API protection are foundational to AI LLM security.

Moreover, traffic over the public Internet and the private encrypted mesh needs to be observable and controllable by NetOps and SecOps teams to manage what is essentially a complex and heterogeneous multicloud infrastructure.

Figure 3: Multicloud networking and observability are essential in a distributed AI deployment. which can span across on-premises data centers, private cloud, public cloud, and edge.

Keen to learn more?

As AI adoption accelerates, building resiliency into AI systems is critical to ensure long-term success. GPUaaS offers a scalable and efficient solution, but organizations must navigate challenges such as data security, performance variability, and regulatory compliance. By addressing these concerns and leveraging the flexibility of GPUaaS, businesses can better position themselves to meet the growing demands of AI-driven workloads.

If you want to explore how AI resiliency can empower your organization, visit us at the upcoming GovWare conference at Booth P06, October 15 – 17, at Sands Expo and Convention Centre where we’ll be discussing these trends and solutions in more detail.

Featured Blog Posts

F5 accelerates and secures AI inference at scale with NVIDIA Cloud Partner reference architecture

Securing AI models and agents without compromise: How F5’s acquisition of CalypsoAI will deliver end-to-end AI runtime protection

Quantum ready: A practical guide to enabling PQC with F5

About the Author

Chin Keng Lim

More blogs by Chin Keng Lim

Featured Blog Posts

F5 accelerates and secures AI inference at scale with NVIDIA Cloud Partner reference architecture

Securing AI models and agents without compromise: How F5’s acquisition of CalypsoAI will deliver end-to-end AI runtime protection

Quantum ready: A practical guide to enabling PQC with F5

Related Blog Posts

F5 Ecosystem | 12/19/2025

AppViewX + F5: Automating and orchestrating app delivery

As an F5 ADSP Select partner, AppViewX works with F5 to deliver a centralized orchestration solution to manage app services across distributed environments.

F5 Application Delivery and Security Platform (ADSP),

Strategic alliance,

BIG-IP,

F5 NGINX

F5 Ecosystem | 12/09/2025

Build a quantum-safe backbone for AI with F5 and NetApp

By deploying F5 and NetApp solutions, enterprises can meet the demands of AI workloads, while preparing for a quantum future.

F5 Application Delivery and Security Platform (ADSP),

BIG-IP,

AI Security

F5 Ecosystem | 11/19/2025

F5 ADSP Partner Program streamlines adoption of F5 platform

The new F5 ADSP Partner Program creates a dynamic ecosystem that drives growth and success for our partners and customers.

F5 Application Delivery and Security Platform (ADSP),

Strategic alliance

F5 Ecosystem | 11/11/2025

F5 NGINX Gateway Fabric is a certified solution for Red Hat OpenShift

F5 collaborates with Red Hat to deliver a solution that combines the high-performance app delivery of F5 NGINX with Red Hat OpenShift’s enterprise Kubernetes capabilities.

F5 NGINX,

2025

F5 Ecosystem | 08/26/2021

F5 Silverline Mitigates Record-Breaking DDoS Attacks

Malicious attacks are increasing in scale and complexity, threatening to overwhelm and breach the internal resources of businesses globally. Often, these attacks combine high-volume traffic with stealthy, low-and-slow, application-targeted attack techniques, powered by either automated botnets or human-driven tools.

Silverline Managed Services,

F5 Silverline DDoS Protection

F5 Ecosystem | 12/08/2020

Phishing Attacks Soar 220% During COVID-19 Peak as Cybercriminal Opportunism Intensifies

David Warburton, author of the F5 Labs 2020 Phishing and Fraud Report, describes how fraudsters are adapting to the pandemic and maps out the trends ahead in this video, with summary comments.

Fraud,

Phishing

GPU as a Service: The Catalyst for AI Growth in ASEAN, Opportunities, and Challenges

Unlocking AI potential in ASEAN

Balancing the benefits and risks of GPUaaS

How does F5 help?

Keen to learn more?

About the Author

Related Blog Posts

AppViewX + F5: Automating and orchestrating app delivery

Build a quantum-safe backbone for AI with F5 and NetApp

F5 ADSP Partner Program streamlines adoption of F5 platform

F5 NGINX Gateway Fabric is a certified solution for Red Hat OpenShift

F5 Silverline Mitigates Record-Breaking DDoS Attacks

Phishing Attacks Soar 220% During COVID-19 Peak as Cybercriminal Opportunism Intensifies

WHAT WE OFFER

RESOURCES

SUPPORT

PARTNERS

COMPANY