CUDA C/C++ on Google Colaboratory

4 min readMay 2, 2022

What is CUDA?

Compute Unified Device Architecture is a parallel computing platform and programming model developed by Nvidia for general computing on its own GPUs. CUDA enables developers to speed up compute-intensive applications by harnessing the power of GPUs for the parallelizable part of the computation.

CUDA code doesn’t run on AMD CPU or Intel HD graphics unless you have a NVIDIA Hardware installed in your Machine.

What is Google Colab?

Google Colab is a free cloud service and the most important feature able to distinguish Colab from other free cloud services is; Colab offers GPU and is completely free! With Colab you can work on the GPU with CUDA C/C++ for free!

Does that mean not every one can use it? Not really, we can use other resources for that like Google Collaboratory So, let’s start with that!

Procedure:

Go to https://colab.research.google.com in Browser and Click on New Notebook

Switch runtime from CPU to GPU:
Click on Runtime > Change runtime type > Hardware Accelerator > GPU > Save

Completely uninstall any previous CUDA versions.We need to refresh the Cloud Instance of CUDA:

!apt-get --purge remove cuda nvidia* libnvidia-*
!dpkg -l | grep cuda- | awk '{print $2}' | xargs -n1 dpkg --purge
!apt-get remove cuda-*
!apt autoremove
!apt-get update*Write code in a separate code Block and Run that code.Every line that starts with ‘!’, it will be executed as a command line command.

Install CUDA Version 9:

!wget https://developer.nvidia.com/compute/cuda/9.2/Prod/local_installers/cuda-repo-ubuntu1604-9-2-local_9.2.88-1_amd64 -O cuda-repo-ubuntu1604-9-2-local_9.2.88-1_amd64.deb
!dpkg -i cuda-repo-ubuntu1604-9-2-local_9.2.88-1_amd64.deb
!apt-key add /var/cuda-repo-9-2-local/7fa2af80.pub
!apt-get update
!apt-get install cuda-9.2

Check your CUDA installation by running the command given below :

!nvcc --version

Install a small extension to run nvcc from the Notebook cells:

!pip install git+https://github.com/andreinechaev/nvcc4jupyter.git

Load the extension using the code given below:

%load_ext nvcc_plugin

Now we are ready to run CUDA C/C++ code right in your Notebook.

Checking if CUDA is working or not:
To run the code in your notebook, add the %%cu extension at the beginning of your code.

%%cu#include <stdio.h>
#include <stdlib.h>__global__ void add(int *a, int *b, int *c) {
    *c = *a + *b;
}int main() {
    int a, b, c;// host copies of variables a, b & c
    int *d_a, *d_b, *d_c;// device copies of variables a, b & c
    int size = sizeof(int);// Allocate space for device copies of a, b, c
    cudaMalloc((void **)&d_a, size);
    cudaMalloc((void **)&d_b, size);
    cudaMalloc((void **)&d_c, size);// Setup input values
    c = 0;
    a = 3;
    b = 5;// Copy inputs to device
    cudaMemcpy(d_a, &a, size, cudaMemcpyHostToDevice);
    cudaMemcpy(d_b, &b, size, cudaMemcpyHostToDevice);// Launch add() kernel on GPU
    add<<<1,1>>>(d_a, d_b, d_c);// Copy result back to host
    cudaError err = cudaMemcpy(&c, d_c, size,    cudaMemcpyDeviceToHost);if(err!=cudaSuccess) {
       printf("CUDA error copying to Host: %s\n", cudaGetErrorString(err));
    }printf("result is %d\n",c);// Cleanup
    cudaFree(d_a);
    cudaFree(d_b);
    cudaFree(d_c);return 0;
}

and YOU DID IT!

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Written by M-Zohaib Nasir

21 Followers

15 Following

Machine Learning Engineer!

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

Recommended from Medium

TensorFlow GPU setup with Jupyter Notebook (for Windows)

Berika Varol Malkoçoğlu

TensorFlow GPU setup with Jupyter Notebook (for Windows)

Jupyter Notebook is one of the most popular IDEs for data science. If you have installed Anaconda Navigator and installed Python 3.x, you…

Oct 28, 2024

How To Create A TensorRT Engine Version 10.4.0

Max Melichov

How To Create A TensorRT Engine Version 10.4.0

Deploying deep learning models for real-time inference requires a solution that can optimize models and run them efficiently on GPUs. In…

Oct 12, 2024

Lists

Staff picks

827 stories1648 saves

Stories to Help You Level-Up at Work

19 stories948 saves

Self-Improvement 101

20 stories3355 saves

Productivity 101

20 stories2818 saves

Manyi

Tips on using Mac GPU for running a LLM

1 Check what GPU is available

Oct 31, 2024

Geek Out Time: Simulating Distributed Training on TPU & GPU in Google Colab

The Constellar Digital&Technology Blog

Nedved Yang

Geek Out Time: Simulating Distributed Training on TPU & GPU in Google Colab

Introduction

Feb 22

Intuitively and Exhaustively Explained

Daniel Warfield

CUDA for AI — Intuitively and Exhaustively Explained

Parallelized AI from scratch in CUDA

Jun 14, 2024

1.3K

How To Train Your PyTorch Models (Much) Faster

Level Up Coding

Sahib Dhanjal

How To Train Your PyTorch Models (Much) Faster

Tips and tricks I learnt while working with the best in the industry

Feb 10

705

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams