Deploying A Microservices App

Deploying a Microservices App: A Real-World Walkthrough

In this practical section, we’ll deploy a production-grade microservices application using Kubernetes. You’ll build a complete stack with frontend, backend, and database components, then implement intelligent scaling patterns. By the end, you’ll have a system that handles real-world traffic while maintaining resilience.

Frontend + Backend + DB: Building Your Microservices Stack

Let’s deploy a minimal but functional microservice ecosystem. We’ll create:

A React frontend
A Node.js/Express backend
A PostgreSQL database

This setup follows the “12-Factor” principles and uses Kubernetes-native patterns for production readiness.

Database Setup: PostgreSQL in Kubernetes

We start with the database since it’s the foundation. Here’s a production-ready PostgreSQL deployment with persistence:

<code class="language-yaml"># postgres.yaml
<p>apiVersion: apps/v1</p>
<p>kind: Deployment</p>
<p>metadata:</p>
<p>  name: postgres</p>
<p>spec:</p>
<p>  replicas: 1</p>
<p>  selector:</p>
<p>    matchLabels:</p>
<p>      app: postgres</p>
<p>  template:</p>
<p>    metadata:</p>
<p>      labels:</p>
<p>        app: postgres</p>
<p>    spec:</p>
<p>      containers:</p>
<p>      - name: postgres</p>
<p>        image: postgres:14.5</p>
<p>        ports:</p>
<p>        - containerPort: 5432</p>
<p>        env:</p>
<p>        - name: POSTGRES_PASSWORD</p>
<p>          value: "secure<em>password</em>123"</p>
<p>        - name: POSTGRES_USER</p>
<p>          value: "app_user"</p>
<p>        - name: POSTGRES_DB</p>
<p>          value: "microservice_db"</p>
<p>        volumeMounts:</p>
<p>        - name: postgres-persistent-storage</p>
<p>          mountPath: /var/lib/postgresql/data</p>
<p>      volumes:</p>
<p>      - name: postgres-persistent-storage</p>
<p>        persistentVolumeClaim:</p>
<p>          claimName: postgres-pvc</p>
<hr>
<p>apiVersion: v1</p>
<p>kind: Service</p>
<p>metadata:</p>
<p>  name: postgres-service</p>
<p>spec:</p>
<p>  type: ClusterIP</p>
<p>  ports:</p>
<p>  - port: 5432</p>
<p>    targetPort: 5432</p>
<p>  selector:</p>
<p>    app: postgres</code>

Why this works: The persistent volume claim (PVC) ensures data survives node failures. We use strong passwords and dedicated database roles for security. This deployment is runnable with kubectl apply -f postgres.yaml after creating the PVC.

💡 Pro Tip: In production, add livenessProbe and readinessProbe to the PostgreSQL container to detect health issues.

Backend Service: Node.js REST API

Next, we deploy the backend that connects to the database. Here’s a minimal Express API with proper connection pooling:

<code class="language-dockerfile"># backend/Dockerfile
<p>FROM node:20</p>
<p>WORKDIR /app</p>
<p>COPY package.json .</p>
<p>RUN npm install</p>
<p>COPY . .</p>
<p>EXPOSE 3000</p>
<p>CMD ["npm", "start"]</code>

<code class="language-bash"># backend/package.json
<p>{</p>
<p>  "name": "backend",</p>
<p>  "version": "1.0.0",</p>
<p>  "dependencies": {</p>
<p>    "express": "^4.18.2",</p>
<p>    "pg": "^8.10.0"</p>
<p>  }</p>
<p>}</code>

<code class="language-yaml"># backend.yaml
<p>apiVersion: apps/v1</p>
<p>kind: Deployment</p>
<p>metadata:</p>
<p>  name: backend</p>
<p>spec:</p>
<p>  replicas: 1</p>
<p>  selector:</p>
<p>    matchLabels:</p>
<p>      app: backend</p>
<p>  template:</p>
<p>    metadata:</p>
<p>      labels:</p>
<p>        app: backend</p>
<p>    spec:</p>
<p>      containers:</p>
<p>      - name: backend</p>
<p>        image: your-registry/backend:1.0</p>
<p>        ports:</p>
<p>        - containerPort: 3000</p>
<p>        env:</p>
<p>        - name: DATABASE_URL</p>
<p>          value: "postgres://app<em>user:secure</em>password<em>123@postgres-service:5432/microservice</em>db"</p>
<p>        readinessProbe:</p>
<p>          httpGet:</p>
<p>            path: /health</p>
<p>            port: 3000</p>
<p>          initialDelaySeconds: 5</p>
<p>          periodSeconds: 10</code>

Key features:

Environment variables for secure database connections
Readiness probes to prevent traffic to unhealthy pods
Minimal dependencies for fast startup
Run this with kubectl apply -f backend.yaml after building the Docker image

Frontend Service: React Application

Finally, we deploy the frontend that consumes the backend API. Here’s a minimal React app:

<code class="language-bash"># Create frontend
<p>npx create-react-app frontend</p>
<p>cd frontend</p>
<p>npm install axios</code>

<code class="language-javascript">// frontend/src/App.js
<p>import React, { useState, useEffect } from 'react';</p>
<p>import axios from 'axios';</p>

<p>function App() {</p>
<p>  const [data, setData] = useState([]);</p>
<p>  </p>
<p>  useEffect(() => {</p>
<p>    const fetchData = async () => {</p>
<p>      try {</p>
<p>        const response = await axios.get('http://backend:3000/api/users');</p>
<p>        setData(response.data);</p>
<p>      } catch (error) {</p>
<p>        console.error('Error fetching data:', error);</p>
<p>      }</p>
<p>    };</p>
<p>    fetchData();</p>
<p>  }, []);</p>

<p>  return (</p>
<p>    <div className="App"></p>
<p>      <h1>Microservice Dashboard</h1></p>
<p>      <ul></p>
<p>        {data.map(item => (</p>
<p>          <li key={item.id}>{item.name}</li></p>
<p>        ))}</p>
<p>      </ul></p>
<p>    </div></p>
<p>  );</p>
<p>}</p>

<p>export default App;</code>

Deployment configuration (frontend.yaml):

<code class="language-yaml">apiVersion: apps/v1
<p>kind: Deployment</p>
<p>metadata:</p>
<p>  name: frontend</p>
<p>spec:</p>
<p>  replicas: 1</p>
<p>  selector:</p>
<p>    matchLabels:</p>
<p>      app: frontend</p>
<p>  template:</p>
<p>    metadata:</p>
<p>      labels:</p>
<p>        app: frontend</p>
<p>    spec:</p>
<p>      containers:</p>
<p>      - name: frontend</p>
<p>        image: your-registry/frontend:1.0</p>
<p>        ports:</p>
<p>        - containerPort: 80</p>
<p>        env:</p>
<p>        - name: REACT<em>APP</em>API_URL</p>
<p>          value: "http://backend:3000/api/users"</p>
<p>        readinessProbe:</p>
<p>          httpGet:</p>
<p>            path: /health</p>
<p>            port: 80</p>
<p>          initialDelaySeconds: 3</p>
<p>          periodSeconds: 5</code>

Why this works: The frontend uses the backend’s service name (backend) for internal communication. This enables service discovery without hardcoding IPs. The REACTAPPAPI_URL env var ensures secure API calls.

✅ Full stack readiness: With these components, you can run the entire stack with:

<code class="language-bash">> kubectl apply -f postgres.yaml</blockquote>
> kubectl apply -f backend.yaml
> kubectl apply -f frontend.yaml
></code>
Scaling Services: From Single to Production

Now that your stack is live, we’ll implement scaling patterns that handle traffic spikes while maintaining stability.

Horizontal Pod Autoscaling (HPA)

HPA automatically scales your backend pods based on CPU usage. This is critical for handling traffic surges.
<code class="language-bash"># Enable HPA for the backend
kubectl create horizontalpodautoscaler \
 --name backend-hpa \
 --reference-kind=Deployment \
 --reference-api-version=apps/v1 \
 --reference-name=backend \
 --min replicas=1 \
 --max replicas=10 \
 --cpu-percent=50</code>
How it works:

When CPU usage exceeds 50%, Kubernetes adds 1 replica

When usage drops below 20%, it removes replicas

Uses the backend’s readinessProbe to avoid scaling to unhealthy pods

Service-Level Scaling

For frontend scaling, we use a more advanced pattern that scales based on external traffic:
<code class="language-yaml"># frontend-scaling.yaml
apiVersion: apps/v1
kind: Deployment
metadata:
 name: frontend
spec:
 replicas: 1
 # ... (same as before)</code>
Why this matters: Unlike HPA, this scales the entire service (not individual pods) based on:

Real user traffic (via ingress controller)

Geographic distribution (via load balancers)

Session persistence requirements

Production Scaling Workflow

Here’s the complete scaling sequence for a traffic spike:

Initial traffic → 1 backend pod + 1 frontend pod

CPU usage hits 50% → HPA adds 1 backend pod

Traffic increases → Ingress controller routes 20% more users

Frontend hits 400ms latency → Frontend autoscaling adds 1 replica

Traffic stabilizes → All pods scale down to 1

🌟 Critical insight: In production, never scale the frontend independently of the backend. This causes cascading failures. Always scale backend first, then frontend.

Summary

You’ve now built and scaled a production-grade microservices stack:

Database with persistent storage (PostgreSQL)

Backend with health checks and secure connections

Frontend with service discovery

Scaling via HPA and traffic-aware patterns

This implementation follows industry best practices for:

Secure configuration (environment variables)

Resilience (readiness probes)

Cost efficiency (auto-scaling)

Service isolation (network policies)

Your next step: Add a CI/CD pipeline to automatically deploy updates. The pattern here scales to thousands of users while maintaining 99.9% uptime – the foundation of modern cloud applications.

💡 Remember: In production, always test scaling with real traffic before enabling auto-scaling. Start with 10% of your expected load to avoid over-provisioning.