Quantcast
Browsing all 196 articles
Browse latest View live

Image may be NSFW.
Clik here to view.

Demystifying Kubernetes Cloud Cost Management: Strategies for Visibility,...

Kubernetes cloud cost management is the process of tracking, attributing, and reducing the expenses associated with running clusters in the cloud. Although Kubernetes can improve DevOps operational...

View Article


Image may be NSFW.
Clik here to view.

Mastering Kubernetes Namespaces: Advanced Isolation, Resource Management, and...

Kubernetes namespaces let you separate logical groups of resources within a single Kubernetes cluster. They’re used to share clusters between different apps and provide platform teams with many...

View Article


Image may be NSFW.
Clik here to view.

Rafay Unveils Groundbreaking Platform-as-a-Service (PaaS) Innovations for AI...

In the bustling world of technology, innovation is the lifeblood of progress. At Team Rafay, we continue to innovate and challenge ourselves to go farther than we thought possible. Today, I am thrilled...

View Article

Image may be NSFW.
Clik here to view.

Optimizing Amazon EKS: Advanced Configuration, Scaling, and Cost Management...

Amazon’s Elastic Kubernetes Service (EKS) makes it easy to provision and operate cloud-hosted Kubernetes clusters using AWS. It’s a managed service that automates the process of creating a control...

View Article

Image may be NSFW.
Clik here to view.

CPU vs GPU – A Lethal Combination for AI/ML

This is a multi-series blog on GPUs, how they intersect with Kubernetes and containers. In this blog, we will discuss how CPUs and GPUs are architecturally similar and different. We will also review...

View Article


Image may be NSFW.
Clik here to view.

Navigating Container Management Challenges: Strategies for Security,...

Containers have transformed how software is built and deployed but they pose unique management challenges that can be daunting for DevOps teams to address. You need an effective strategy to mitigate...

View Article

Image may be NSFW.
Clik here to view.

Rafay honored as Gold Stevie® Award Winner in 2024 American Business Awards®

Gold is our new favorite color. This year, more than 300 professionals worldwide participated in the judging process to select this year’s American Business, a.k.a. Stevie Award winners. The awards...

View Article

Image may be NSFW.
Clik here to view.

Navigating MLOps for Platform Teams: Key Challenges and Emerging Best Practices

MLOps is a new discipline that defines processes and best practices for effectively managing machine learning (ML) development and deployment workflows. With demand for ML and generative AI apps...

View Article


Image may be NSFW.
Clik here to view.

LLMOps for Platform Teams: How LLMOps Powers the GenAI Revolution

Generative AI has risen to prominence as the next technology revolution. It’s driven by the surging adoption of Large Language Models (LLMs) such as GPT and Llama, machine learning models that are...

View Article


Image may be NSFW.
Clik here to view.

Mastering Kubernetes Management: Challenges and Best Practices

Kubernetes empowers you to reliably operate and scale cloud-native apps, but it can be daunting to manage your Kubernetes clusters and their associated infrastructure resources. The need to maintain...

View Article

Image may be NSFW.
Clik here to view.

Kubernetes Management with Amazon EKS

Kubernetes management is the process of administering your Kubernetes clusters, their node fleets, and their workloads. Organizations seeking to use Kubernetes at scale must understand effective...

View Article

Image may be NSFW.
Clik here to view.

EC2 vs. Fargate for Amazon EKS: A Cost Comparison

When it comes to running workloads on Amazon Web Services (AWS), two popular choices are Amazon Elastic Compute Cloud (EC2) and AWS Fargate. Both have their merits, but understanding their cost...

View Article

Image may be NSFW.
Clik here to view.

User Access Reports for Kubernetes

Access reviews are required and mandated by regulations such as SOX, HIPAA, GLBA, PCI, NYDFS, and SOC-2. Access reviews are critical to help organizations maintain a strong risk management posture and...

View Article


Image may be NSFW.
Clik here to view.

User Access Reports for Kubernetes

Access reviews are required and mandated by regulations such as SOX, HIPAA, GLBA, PCI, NYDFS, and SOC-2. Access reviews are critical to help organizations maintain a strong risk management posture and...

View Article

Image may be NSFW.
Clik here to view.

PyTorch vs. TensorFlow: A Comprehensive Comparison

When it comes to deep learning frameworks, PyTorch and TensorFlow are two of the most prominent tools in the field. Both have been widely adopted by researchers and developers alike, and while they...

View Article


Image may be NSFW.
Clik here to view.

What GPU Metrics to Monitor and Why?

With the increasing reliance on GPUs for compute-intensive tasks such as machine learning, deep learning, data processing, and rendering, both infrastructure administrators and users of GPUs (i.e. data...

View Article

Image may be NSFW.
Clik here to view.

Building an Extensible GenAI Copilot: What We Learned

Working through the complexities of developing an internal copilot helped us push the boundaries of what we believed possible with GenAI. Our generative AI (GenAI) journey began with a single use case:...

View Article


Image may be NSFW.
Clik here to view.

GPU Metrics – Power

In the previous blog, we discussed why tracking and reporting GPU SM Clock metrics matters. In this blog, we will dive deeper into another critical GPU metric i.e. GPU Power. Important Navigate to...

View Article

Image may be NSFW.
Clik here to view.

GPU Metrics – Framebuffer

In the previous blog, we discussed why tracking and reporting GPU power usage matters. In this blog, we will dive deeper into another critical GPU metric i.e. GPU Framebuffer usage. Important Navigate...

View Article

Image may be NSFW.
Clik here to view.

GPU Metrics – SM Clock

In the previous blog, we discussed why tracking and reporting GPU Memory Utilization metrics matters. In this blog, we will dive deeper into another critical GPU metric i.e. GPU SM Clock. The GPU SM...

View Article
Browsing all 196 articles
Browse latest View live