Beginner’s Journey into Google Cloud Platform’s Fundamentals

Muhammad Fahad
13 min readMay 29, 2023

--

Google Cloud Platform is a cloud computing service offered by Google. It leverages Google’s extensive infrastructure and expertise to deliver a wide range of services that cater to diverse needs. It provides resources such as computing power, storage, databases and developer tools over the internet, allowing individuals and businesses to build, deploy and manage applications and services without the need to invest in and maintain physical infrastructure.

Google Cloud Platform offers a wide array of services that cater to different aspects of cloud computing. Here are some of the major services:

  • Compute Engine: Compute Engine allows you to create and manage virtual machines or VMs in the cloud. It offers customizable virtual machines with different specifications to match your compute needs. You have the flexibility to scale your resources up or down based on demand ensuring optimal performance and cost-efficiency.
  • Storage: Storage services provided by Google Cloud Platform allow you to store, manage, and access your data in the cloud. GCP offers a range of storage options to meet different needs, including object storage, file storage, database storage and block storage.
  • Big Query: It is a managed and serverless data warehouse built for analyzing vast data. It allows you to execute rapid, SQL-like queries on extensive data sets to extract valuable insights. Big Query smoothly integrates with other GCP services, making the process of data ingestion and analysis pipelines simpler and more efficient.
  • PubSub: PubSub is a messaging service that facilitates reliable, asynchronous communication between independent applications. It decouples senders (publishers) and receivers (subscribers), enabling scalable and loosely coupled architectures. Pub/Sub is commonly used for building event-driven systems and real-time data streaming.

Google Compute Engine

Google Compute Engine is like having your own computer in the cloud. Instead of using a physical computer, you can create virtual machines (VMs) on Google’s powerful servers. These virtual machines work just like real computers, but they exist in the cloud instead of on your desk.

You have the freedom to choose the size and power of your virtual machine. It’s like picking a computer with the right amount of memory (RAM), processing power (CPU) and storage space (hard drive) for your needs. You can start with a small machine and easily upgrade to a bigger one as your needs grow. One of the cool things about Compute Engine is that it’s really flexible. You can create and run as many virtual machines as you want, and you can turn them on and off whenever you need them. This way, you only pay for the computing power you actually use, which can save you money.

Compute Engine is part of the Google Cloud Platform, which means it works seamlessly with other Google Cloud services. You can easily integrate your virtual machines with services like Cloud Storage for storing files, Big Query for analyzing data, or Dataflow for processing large amounts of information.

Google Data Flow

Google Dataflow is a service provided by Google Cloud Platform (GCP) that allows you to build and execute data processing pipelines. It helps developers and data engineers process and analyze large amounts of data in a scalable and efficient manner. Imagine you have a big box full of different colored balls, and you want to sort them by color. Doing it manually would take a long time, right? But with Dataflow, you can automate the sorting process and get it done much faster.

Dataflow works by breaking down your data processing tasks into smaller steps called “transforms.” Each transform represents a specific action, like filtering, aggregating or transforming the data. You can connect these transforms together to create a pipeline that defines the entire data processing workflow. Once pipeline is built, Dataflow takes care of the pipeline. It automatically distributes the work across multiple computers, called “workers,” which can process data in parallel. This means that multiple tasks can be performed simultaneously, speeding up the overall processing time.

Dataflow also takes care of managing resources efficiently. It automatically scales up or down based on the size of your data and the processing requirements, so you don’t have to worry about setting up and managing the infrastructure yourself. It’s like having an elastic workforce that adapts to your needs. Dataflow provides built-in tools for monitoring and troubleshooting your data pipelines. You can track the progress of your jobs, monitor resource usage, and analyze the performance of your data processing tasks. This helps you identify any issues or bottlenecks and optimize your pipelines for better efficiency.

Google Cloud Storage

Google Cloud Storage is like having a giant online storage space where you can store and access your files and data. It’s a service provided by Google Cloud Platform that offers reliable, scalable, and secure storage for your information. Think of it as a virtual warehouse where you can keep all your digital belongings. Whether it’s documents, images, videos, or any other type of file, you can store them securely in Google Cloud Storage.

You can start with a small amount of storage and easily expand as your needs grow. It’s like having an infinite storage capacity, so you don’t have to worry about running out of space.

Cloud Storage also provides high durability, meaning that your files are stored safely and protected against data loss. Google ensures that your data is replicated across multiple servers and locations, so even if one server fails, your files remain safe and accessible. When it comes to security, Cloud Storage ensures the confidentiality and integrity of your files. It offers features like access controls, allowing you to specify who can access your files and what they can do with them. It also provides encryption at rest and in transit to protect your data from unauthorized access.

Accessing your files stored in Cloud Storage is simple and convenient. You can easily retrieve your files from anywhere using a web interface or through an Application Programming Interface (API). This means you can access your files from your computer, smartphone or any other device connected to the internet.

Cloud Storage offers advanced features like data lifecycle management, which allows you to automate the movement and deletion of files based on their age or usage. This helps you optimize storage costs and maintain an organized storage environment.

Google Cloud Pub/Sub

Google Cloud Pub/Sub (Publisher/Subscriber) is like having a messaging system that allows different parts of your applications or services to communicate with each other. It’s a service provided by Google Cloud Platform that enables reliable and asynchronous messaging between components in a distributed system. For Example, Spotify use Pub/Sub to notify users about new song releases. They would have a topic called “NewSongReleases” where they publish messages containing details about the latest songs. Users who are interested in receiving these notifications can subscribe to the topic. Whenever a new song is released, Pub/Sub delivers the message to all subscribers, allowing them to stay updated and listen to the new music.

You can have publishers that send messages and subscribers that receive those messages. Publishers can be any application or service that wants to send information, and subscribers can be other applications or services that are interested in receiving and processing that information.

One of the key advantages of Pub/Sub is its reliability. When a message is published, it is stored in a durable and highly available system. This means that even if a subscriber is temporarily unavailable, it won’t miss any messages. The messages will be stored until the subscriber is ready to receive and process them. It also ensures that messages are delivered in the order they were published. So, if multiple messages are sent, they will be received and processed in the same order by the subscribers. This is important for maintaining the integrity and consistency of the data being exchanged.

Pub/Sub have the ability to handle high message. It can handle a large number of messages being published and consumed, making it suitable for building scalable and real-time systems. It’s like having a messaging system that can handle thousands or even millions of messages per second.

Google Cloud IAM

Google Cloud IAM, short for Identity and Access Management is a service provided by Google Cloud Platform that helps you manage and control access to your cloud resources. It allows you to define who has what level of access to your resources and what actions they can perform. IAM acts like a security guard for your cloud environment, ensuring that only authorized individuals or systems can access your resources. It helps you protect your data, control permissions and maintain the overall security of your cloud infrastructure.

Here’s how IAM works in a simple way:

  1. Users and Roles: In IAM, you can create users, which represent individuals or entities who need access to your resources. Each user is assigned a role, which defines the set of permissions they have. For example, you can have an “Admin” role for someone who needs full control and a “Viewer” role for someone who only needs to view resources.
  2. Projects and Resources: Your cloud resources, such as virtual machines, databases or storage are organized into projects. IAM allows you to grant or revoke access at the project level or for specific resources within a project. This gives you fine-grained control over who can access what.
  3. Permissions: IAM provides a set of predefined permissions that determine what actions can be performed on resources. These permissions include actions like read, write, create, delete and more. By assigning appropriate roles and permissions, you can ensure that users have the necessary access to perform their tasks without granting unnecessary privileges.
  4. Access Control: IAM allows you to manage access control based on the principle of least privilege. This means that users are granted only the permissions they need to perform their tasks, minimizing the risk of accidental or malicious actions. You can easily add or remove users, adjust their roles, and update permissions as needed.

By leveraging Google Cloud IAM, you can effectively manage and control access to your cloud resources ensuring that only authorized users can perform specific actions. This helps protect your data, maintain security, and maintain compliance with regulatory requirements.

Google Cloud BigQuery

Google Cloud BigQuery is a fully managed data warehouse and analytics platform offered by Google Cloud Platform. It simplifies the process of storing and analyzing large datasets without the need for complex infrastructure management. You can store structured and semi-structured data, such as customer records or log files in a highly scalable and distributed manner. The platform operates on a serverless model, eliminating the need for provisioning and managing infrastructure, so you can focus on analyzing data and deriving insights.

BigQuery is known for its speed and efficiency in processing queries. It utilizes a distributed architecture and parallel processing to execute queries across multiple nodes, allowing for fast analysis even with vast amounts of data. The platform automatically scales resources based on workload, ensuring scalability without capacity limitations. This makes BigQuery suitable for organizations of all sizes and you only pay for the storage and processing resources you use.

Users can leverage BigQuery’s SQL-like query language to explore, transform, and aggregate data. It supports standard SQL functions and advanced analytics features, enabling users with SQL skills to write queries for data analysis. The platform seamlessly integrates with other Google Cloud services, such as Cloud Storage and Google Sheets, allowing easy data loading and integration with visualization tools like Google Data Studio.

BigQuery prioritizes data security and governance. It provides encryption at rest and in transit, allowing you to protect your data from unauthorized access. Access controls and permissions can be set to ensure only authorized users can access and manipulate data. Additionally, BigQuery offers auditing and monitoring capabilities to track data access and changes, enhancing data security.

Google Cloud SQL

Cloud SQL is a fully managed relational database service provided by Google Cloud Platform. It allows you to easily deploy, manage, and scale popular relational databases such as MySQL, PostgreSQL, and SQL Server in the cloud. Think of Cloud SQL where you can store and organize your data in a structured manner. It takes care of the underlying infrastructure and database management tasks, so you can focus on using and querying your data without worrying about server maintenance or backups.

You can create databases with just a few clicks. It provides a user-friendly interface where you can set up your database instance, define the desired configuration, and choose the appropriate database engine. Whether you need a small database for a simple web application or a large-scale database for enterprise-level workloads, Cloud SQL can handle it.

Once your database is up and running, Cloud SQL ensures high availability and reliability. It automatically replicates your data across multiple locations, protecting it from hardware failures or other disruptions. This means your applications can access the database without interruption, and you don’t have to worry about data loss.

Cloud SQL also offers scalability. As your data grows or your application demands increase, you can easily scale up your database instance to handle the additional load. You can adjust the computing power and storage capacity of your database without any downtime, ensuring that your application performance remains consistent. Advantage of Cloud SQL is its integration with other Google Cloud services. For example, you can connect your database to Compute Engine or Kubernetes Engine to power your applications. You can also use Cloud SQL in conjunction with other Google Cloud services like BigQuery or Data Studio for advanced analytics and reporting.

Looker

Google Cloud Looker is a powerful business intelligence (BI) and data analytics platform that helps organizations explore, analyze and visualize their data in a user-friendly way. Imagine you have a treasure trove of data stored in different places like databases, spreadsheets or cloud services. Looker acts as a guide that helps you navigate through this data and uncover valuable insights. It provides a centralized platform where you can connect to various data sources and create interactive dashboards and reports.

You can easily create visually appealing visualizations such as charts, graphs and maps without any coding skills. It offers a drag-and-drop interface and a library of pre-built visualization options. You can customize these visualizations to represent your data in a way that makes sense to you and your audience. One of the standout features of Looker is its ability to transform raw data into meaningful metrics and dimensions. You can build complex data models and define relationships between different data sources, allowing you to perform advanced calculations and slice-and-dice your data to gain deeper insights.

Looker also promotes collaboration within your organization. You can share reports and dashboards with colleagues or clients, control access permissions, and even enable real-time collaboration. This encourages knowledge sharing, data-driven decision-making and ensures that everyone is on the same page when it comes to data analysis. It offers embedded analytics capabilities, allowing you to seamlessly integrate data visualizations and reports into your own applications or websites. This enables you to deliver data-driven insights directly to your customers or stakeholders without requiring them to navigate a separate tool or interface.

Summary

Google Cloud Platform (GCP) is a comprehensive suite of cloud computing services provided by Google. It offers a wide range of tools and services to help businesses build, deploy, and scale their applications and infrastructure in the cloud.

GCP provides a robust and reliable infrastructure that spans across data centers worldwide, allowing businesses to leverage the power of Google’s global network. It offers scalable computing resources, storage options, databases, machine learning services, data analytics tools and much more.

Some of the key services provided by GCP include:

  1. Compute Engine: Compute Engine offers virtual machines (VMs) in the cloud for running applications and workloads. It provides scalable and customizable computing resources.
  2. Pub/Sub: A messaging service for asynchronous communication between applications. It allows reliable and scalable messaging and event-driven architectures.
  3. Looker: A business intelligence and data analytics platform. It enables data exploration, visualization and collaboration for insights-driven decision-making.
  4. Cloud Dataflow: A fully managed service for executing batch and stream data processing pipelines. It enables data extraction, transformation and loading (ETL) operations at scale.
  5. Cloud Storage: A scalable and durable object storage service for storing and accessing data. It supports unstructured and structured data for building data lakes or archival storage.
  6. Cloud IAM (Identity and Access Management): It manages user access and permissions to GCP resources and ensures secure authentication and authorization for users and services.
  7. BigQuery: A fully managed, serverless data warehouse for analyzing large datasets. It offers fast querying and scalability for advanced analytics and data exploration.
  8. Cloud SQL: A fully managed relational database service. It supports popular databases like MySQL, PostgreSQL, and SQL Server in the cloud.

These are just a few examples of the services offered by GCP. The platform provides a wide array of tools and services to meet various business needs, including infrastructure management, storage, databases, analytics, machine learning and more. GCP is known for its reliability, security, scalability, and performance. It offers a flexible pricing model that allows businesses to pay for what they use and scale their resources as needed. With its global reach and extensive set of services, GCP empowers businesses to innovate, scale, and succeed in the cloud.

--

--

Muhammad Fahad
Muhammad Fahad

Written by Muhammad Fahad

Enthusiastic tech aficionado with a relentless drive to explore new trends and technologies / Data Engineer | Cloud | Data Warehouse

No responses yet