Understanding Concurrency: A Brief Guide to Common Patterns and Models

Bruno Krebs

2022-10-19

Introduction

Concurrency is a crucial aspect of modern computing that enables software to perform multiple tasks at once, enhancing both scalability and efficiency. This article delves into various concurrency models and patterns that developers commonly employ to handle simultaneous operations effectively in software development. These models are integral for creating responsive, high-performing applications across different programming environments.

Concurrency vs. Parallelism

Understanding the distinction between concurrency and parallelism is crucial when exploring concurrency models. To clarify:

Concurrency refers to the capability of a system to manage multiple tasks by allowing them to make progress simultaneously or in an interleaved fashion. It focuses on the structure and management of tasks rather than their execution on hardware.
Parallelism involves the actual simultaneous execution of multiple tasks, requiring multiple processing units, such as CPU cores. This form of concurrency leverages hardware to perform several operations at the same time.

In essence, while concurrency concerns itself with the handling and ordering of multiple tasks, parallelism refers specifically to the simultaneous execution of these tasks, exploiting multi-core architectures for efficiency.

Thread-Based Concurrency

Threads are lightweight processes that run concurrently within a single process, sharing its memory space and resources. This architecture allows efficient inter-thread communication and data sharing, making thread-based concurrency ideal for enhancing performance in both I/O-bound and compute-bound applications. Widely used in Java, C++, and Python, each language offers distinct mechanisms for thread management, from Java’s comprehensive java.util.concurrent package to Python’s Global Interpreter Lock (GIL) optimized for I/O tasks.

Despite their efficiency, threads introduce complexities like race conditions and deadlocks due to shared state access, necessitating synchronization mechanisms like mutexes and locks for thread safety. This approach is key for creating snappy interfaces and making the most of multi-core processors, striking a balance between boosting performance and navigating the complexities of concurrent programming.

Here is a simple example of thread-based concurrency in Java:

public class SimpleThreadExample {

    public static void main(String[] args) {
        // Create two threads using lambda expressions
        Thread thread1 = new Thread(() -> printNumbers("Thread 1"));
        Thread thread2 = new Thread(() -> printNumbers("Thread 2"));

        // Start the threads
        thread1.start();
        thread2.start();
    }

    public static void printNumbers(String threadName) {
        for (int i = 1; i <= 5; i++) {
            System.out.println(threadName + ": " + i);
            try {
                // Sleep for a bit to simulate some work being done
                Thread.sleep(100);
            } catch (InterruptedException e) {
                e.printStackTrace();
            }
        }
    }
}

In this example:

We define a SimpleThreadExample class with a main method that serves as the entry point of the program.
Inside main, we create two Thread objects, thread1 and thread2. Each thread is given a task to execute, defined by a lambda expression that calls the printNumbers method.
The printNumbers method prints numbers from 1 to 5, pausing for 100 milliseconds between each print to simulate doing some work.
We start the threads by calling the start() method on each Thread object. This causes the run() method of these threads to execute concurrently.

When run, this program will output numbers from 1 to 5 from both threads, demonstrating basic thread-based concurrency in Java. For example, if we save this code in a file named SimpleThreadExample.java, we can compile and run it using the following commands:

1	javac SimpleThreadExample.java && java SimpleThreadExample

Event-Driven Concurrency

Event-driven concurrency operates on tasks triggered by events or messages, avoiding a strict execution sequence. It’s particularly effective in frameworks like Node.js, which excels in handling multiple web requests simultaneously, and in UI frameworks such as React and Angular, where user actions dynamically update the interface. This model allows for efficient resource use, as tasks are executed only in response to specific events, leading to responsive applications that efficiently manage workload and user interactions.

Here is a simple example of event-driven concurrency in Node.js:

const http = require('http');

// Create an HTTP server that responds to every request
const server = http.createServer((req, res) => {
  if (req.url === '/') {
    res.end('Welcome to our home page');
  } else if (req.url === '/about') {
    // Simulate a delay to mimic database fetching or any asynchronous operation
    setTimeout(() => {
      res.end('Here is our short history');
    }, 2000);
  } else {
    res.end('Page not found');
  }
});

server.listen(3000, () => {
  console.log('Server is running on http://localhost:3000');
});

This Node.js server listens for HTTP requests and responds differently based on the URL path. It showcases event-driven concurrency by using a non-blocking setTimeout to simulate an asynchronous operation, such as fetching data from a database. While waiting for the timeout to complete, the server can still process other requests, demonstrating the efficiency of event-driven models in handling I/O-bound tasks.

Actor Model

The Actor Model is a concurrency paradigm that treats “actors” as the fundamental units of computation. In this model, actors are objects that encapsulate state and behavior, interacting through message passing rather than shared state. This design inherently avoids many of the concurrency issues related to shared-memory models, such as race conditions and deadlocks, by ensuring that each actor operates independently, processing messages sequentially. Actors can create more actors, send messages to other actors, and determine how to respond to messages they receive.

Elixir, a dynamic, functional language designed for building scalable and maintainable applications, leverages the Actor Model through its processes. The language provides the GenServer behavior module, a generic server implementation that simplifies the process of building concurrent, fault-tolerant applications. Here’s an example of a simple key-value store implemented in Elixir using GenServer:

defmodule KeyValueStore do
  use GenServer

  # Client API
  def start(default \\ %{}) do
    GenServer.start(__MODULE__, default, name: __MODULE__)
  end

  def put(key, value) do
    GenServer.call(__MODULE__, {:put, key, value})
  end

  def get(key) do
    GenServer.call(__MODULE__, {:get, key})
  end

  # Server Callbacks
  def init(initial_state) do
    {:ok, initial_state}
  end

  def handle_call({:put, key, value}, _from, state) do
    # Simulate delay
    :timer.sleep(5000)
    {:reply, :ok, Map.put(state, key, value)}
  end

  def handle_call({:get, key}, _from, state) do
    {:reply, Map.get(state, key), state}
  end
end

In the KeyValueStore module, leveraging Elixir’s GenServer behavior, the system manages a key-value store where each action is processed as an isolated event. The start function initializes the server with a unique name and an optional initial state, enabling direct interactions. For adding or updating entries, the put function employs GenServer.cast to send asynchronous messages, executing without awaiting a response. This method incorporates a 3-second delay within handle_cast to simulate long operations, updating the state efficiently and ensuring the server remains responsive to other requests. Meanwhile, get functions synchronously, using GenServer.call for immediate data retrieval, handled by handle_call. This separation of asynchronous and synchronous methods exemplifies a robust approach to managing state and interactions in high-concurrency environments, demonstrating the Actor Model’s effectiveness in maintaining isolated state and non-blocking operations.

Coroutine-Based Concurrency

Coroutine-based concurrency allows for writing asynchronous code in a way that appears synchronous, which simplifies complex logic that depends on asynchronous operations. In JavaScript, this model is implemented using async/await, built on top of the language’s Promises. This approach enables developers to handle asynchronous operations like API calls, file I/O, or any tasks that require waiting for operations to complete, without blocking the main thread.

Here’s a simple JavaScript example that demonstrates using async/await to perform asynchronous operations:

async function fetchData(url) {
    try {
        console.log("Fetching data...");
        const response = await fetch(url);
        const data = await response.json();
        console.log("Data received:", data);
    } catch (error) {
        console.error("Error fetching data:", error);
    }
}

fetchData('https://api.example.com/data');

In this code, fetchData is an asynchronous function marked by the async keyword. Inside the function, await is used before fetch, which is an asynchronous API call that returns a Promise. The await keyword causes the function execution to pause until the Promise resolves, at which point it resumes with the resolved value. This pause does not block the main JavaScript thread, allowing other operations to continue running. The function handles exceptions with a try-catch block, making error management straightforward. This example encapsulates how coroutine-based concurrency can be elegantly handled in JavaScript, providing a clear structure for asynchronous code that is easy to read and maintain.

Go’s Concurrency Model (Goroutines and Channels)

Go’s concurrency model is built around goroutines and channels, offering a distinctive approach that simplifies concurrent and parallel programming. Goroutines are functions or methods that run concurrently with other goroutines in the same address space. They are lightweight, costing little more than the allocation of stack space, and the Go runtime efficiently manages their execution across available CPU cores. Channels in Go provide a way for goroutines to communicate with each other, allowing them to synchronize execution and share data without explicit locks or condition variables.

Here’s a straightforward example in Go that demonstrates using goroutines and channels to handle concurrency:

package main

import (
	"fmt"
	"time"
)

func worker(id int, jobs <-chan int, results chan<- int) {
	for j := range jobs {
		fmt.Println("worker", id, "processing job", j)
		time.Sleep(time.Second)
		results <- j * 2
	}
}

func main() {
	jobs := make(chan int, 100)
	results := make(chan int, 100)

	for w := 1; w <= 3; w++ {
		go worker(w, jobs, results)
	}

	for j := 1; j <= 5; j++ {
		jobs <- j
	}
	close(jobs)

	for a := 1; a <= 5; a++ {
		<-results
	}
}

In this example, three worker goroutines are created to process jobs. Each worker receives jobs from the jobs channel, processes them by simulating a task (a one-second delay), and sends the result to the results channel. The main function feeds jobs into the jobs channel and retrieves results from the results channel. The use of channels ensures that data flows safely between goroutines without the need for explicit synchronization mechanisms like locks, demonstrating Go’s powerful and straightforward approach to building concurrent applications.