Skip to main content

Google Cloud Platform

Google Cloud Platform (GCP) offered by Google, is a suite of cloud computing services that runs on the same infrastructure that Google uses internally for its end-user products, such as Google Search, Gmail, Google Drive, and YouTube. Alongside a set of management tools, it provides a series of modular cloud services including computing, data storage, data analytics, and machine learning.  ~ Wikipedia           Google Cloud Platform Offerings:  Compute Bare Metal Compute Engine / VM Instance App Engine Google Kubernetes Engine Cloud Run Cloud Function Preemptible VMs VMware Engine  Storage Cloud Storage Firestore Filestore Local SSD Persistent Disks Database Cloud SQL Cloud Spanner Big Table   Analytics Big Query  Networking Virtual Private Networks (VPCs) NAT Gateway Peering   DevOps CI/CD    Monitoring Cloud Monitoring Cloud Logging    More to come... Amropam Tech https://amropamtech.blogspot.com/p/gcp.html
Google Cloud Platform Logo
                                              ~ From internet

Google Cloud Platform (GCP) offered by Google, is a suite of cloud computing services that runs on the same infrastructure that Google uses internally for its end-user products, such as Google Search, Gmail, Google Drive, and YouTube. Alongside a set of management tools, it provides a series of modular cloud services including computing, data storage, data analytics, and machine learning.

~ from Wikipedia

Google Cloud Platform Offerings: 

  • Compute
    • Bare Metal
      • Infrastructure to run the specialized workload on Google Cloud
    • Compute Engine / VM Instance
      • Virtual Machines running in Google's data center 
    • App Engine
      • Serverless Application platform for apps and backends 
      • Use: Web Application Platform
      • Fully Managed, Just code required
      • 2 Modes
        • Standard 
          • Limited Language Support 
          • Specific version for libraries and environment only
          • Scales to zero nodes, for no incoming traffic 
        • Flexible 
          • Custom Environment for development 
          • Need to provide container run time Configuration 
          • Can't scale to zero running nodes
    • Google Kubernetes Engine
      • Managed environment for running containerized apps 
      • Based on Kubernetes, container orchestration service 
      • 2 Modes
        • Manual
        • Auto Pilot
          • No Configurations Required 
    • Cloud Run
      • Fully managed environment for running containerized apps 
    • Cloud Function
      • Serverless service to run functions
      • Event-Driven service 
        • Can trigger a function on some event 
    • Preemptible VMs 
      • Cost-saving VM for stateless computing 
      • Limited time access 
    • Spot VMs
      • Compute instances for batch jobs and fault-tolerant workloads 
      • Used for batch Processing 
      • Fault Tolerance workload 
      • Saves up to 91% of costs 
      • for Stateless Containers 
    • Shielded VMs 
      • Booting level Security 
    • Instance Group 
      • Uses Load Balancer
      • 2 Modes 
        • Managed Instance Groups
          • Auto Scaling Feature 
        • Unmanaged Instance Groups
          • No auto-scaling 
    • VMware Engine
      • Migrate and run on-premises VMware workloads natively on Google Cloud 
  • Storage Classes 
    • Standard
      • Frequent Retrieval and Easy Storage 
      • Free Access i.e. No Access Cost 
      • Efficient Storage Cost 
    • Nearline 
      • Retrieval once a month 
      • Saves Cost 
      • Less Storage Cost 
        • But Higher Access Cost than Standard 
    • Coldline 
      • Retrieval once a Quarter
      • Very Less Storage Cost 
        • But Higher Access Cost than Nearline 
    • Archive 
      • Used for data retrieved once a year 
      • Very very Low Storage Cost 
        • But Higher Access costs than Coldline 
  • Storage
    • Cloud Storage
      • Object storage that is secure, durable, and scalable 
      • Used as Data lake 
      • Stores Unstructured, Semi-structured data also
    • Filestore
      • File Storage that is highly scalable and secure 
      • Used as a directory for remote computing 
    • Local SSD
      • Block storage that is locally attached for higher-performance needs 
    • Persistent Disks
      • Block storage for VMs running on cloud
      • Used to retains Storage even after VM is stopped 
      • Can attach to a single VM instance at a time 
  • Database
    • Bare Metal 
      • Infrastructure to run specialized  workloads on cloud 
    • Cloud SQL
      • Fully-managed database for MySQL, PostgreSQL, and SQL Server
      • Supports Relational databases 
      • Efficiently stores and processes around 10TB of data 
    • Cloud Spanner
      • Cloud-native relational database with unlimited scale and 99.999% availability 
      • Horizontally highly scalable 
      • Consistent Transactions 
      • Distributed data & processing on a cluster 
    • Big Table
      • Cloud-native wide-column database for large-scale low-latency workloads 
      • No-SQL Database 
      • Very Very low latency for I/O operations 
      • Wide Columns storage 
        • 4D database 
      • Based on HBase 
    • Firestore 
      • Cloud-native document database for building rich mobile, web, and IoT apps
      • Auto sync feature 
      • Used as mobile apps database 
    • Firebase Realtime DB
      • NoSQL database for storing and syncing data in realtime 
    • Memorystore
      • In-memory database for managed Redis and Memcached 
      • Used for leaderboard
  • Analytics
    • Big Query
      • Data warehouse for agility and insights 
      • Can process Petabytes of data
      • Used as DataWarehouse 
      • Serverless and cost-effective 
      • 2 Types of data storage 
        • Active 
        • Passive 
      • 2 Modes of usage 
        • Slot Based
        • On-Demand 
      • Flagship Product  
    • Dataproc
      • Service for running Apache Spark and Apache Hadoop clusters 
      • fully managed Hadoop service 
      • Used for ELT and Spark 
      • Data lake modernization tool 
      • rapid cluster creating 
      • Lift and Shift 
    • Dataflow
      • Streaming analytics  for stream and batch processing 
      • Fully managed service for data processing 
      • Serverless and cost-effective 
      • Based on Apache Beam 
      • Unified Stream and Batch Data processing 
      • Uses coding to create a pipeline 
    • Data Fusion 
      • Data ingestion for building and managing data pipelines 
      • Used to create Data Pipelines without coding 
      • Visual representation for pipeline  
    • Dataprep
      • Service to prepare data for analytics and machine learning 
      • Used for Transforming data before Loading 
    • GCP Cloud Composer 
      • Workflow orchestration service built on Apache Airflow 
      • Used for an end to end Pipelining 
      • Airflow as service 
    • Data Catalog
      • Metadata service for exploring and managing data 
    • Dataplex 
      • Intelligent data management across distributed data silos 
  • Business Intelligence 
    • Looker 
      • Platform for BI, data application, and embedded analytics 
      • Increases productivity, decision making by delivering more insight to more users 
      • Embedded Analysis Product 
  • Data Ingestion 
    • Pub/Sub 
      • Managed service for event ingestion and delivery 
      • Async Messaging service 
      • Global routing available 
      • based on Apache Kafka
    • IoT Core
      • used to capture data from IoT Devices 
  • Networking
    • Cloud Armor
      • Security policies and defense against web and DDoS attacks 
    • Cloud CDN
      • Content delivery network for serving web and video content 
    • Cloud Domain 
      • Easy domain name registration and management 
    • Cloud DNS
      • Domain name system for reliable and low-latency name lookups 
    • Cloud IDS
      • Cloud IDS (Cloud Intrusion Detection System) provides cloud-native network threat detection with industry-leading security 
    • Cloud Load Balancing 
      • Service for distributing traffic across applications and regions 
    • Virtual Private Cloud (VPC)
      • Virtual network for cloud resources and cloud-based services 
    • Connecting On-premises with GCP
      • Cloud VPN
        • Secured, Encrypted connection for on-premises 
        • Up to 3Gbps using IPsec
      • Cloud Interconnect 
        • Partner Interconnect 
          • 50Mbps to 10 Gbps 
        • Dedicated Interconnect 
          • 10-100Gbps 
          • Need to Co-located with Google 
      • Peering with Google 
        • Used for cutting egress fees 
        • Access Google's Public services/ Suite on a scale
        • 2 Types
          • Direct Peering 
          • Carrier Peering 
    • NAT Gateway
      • NAT service for giving private instances internet access 
    • Network Intelligence Center
      • Network monitoring, verification, and optimization platform
  • DevOps CI/CD
  • Monitoring
    • Cloud Monitoring
      • One look window for infrastructure and application health with rich metrics 
    • Cloud Logging
      • Cloud audit, platform, and application logs management 
    • Cloud Debugger
      • Realtime application state inspection and in-production debugging 
    • Cloud Profiler
      • CPU and heap profiler for analyzing application performance 
    • Cloud Trace 
      • Tracing system collecting latency data from applications 
    • Cloud Error Reporting
      • Realtime exception monitoring and alerting 
  • AI/ML
    • Vision API
      • Image Analysis 
    • Natural Language 
      • Uses NLP
    • Recommendation API
      • Used for customer recommendation system
    • Translation API
      • Translated text from one language to another 
    • Speech to Text 
      • Converts speech to text
    • Video Intelligence  
      • Used for 
        • Video Analysis 
        • Live feed Analysis 
      • Can detect dangerous behavior in live Video Streaming 
  • More to come...


Comments

Popular posts from this blog

Delivery Foundation Academy (DFA) MCQs

Question  1 Correct Mark 1.00 out of 1.00 Flag question Question text Every sprint starts with _________ and ends with ___________ and ______________. Select one: a. 1. Sprint planning 2. Sprint Review 3. Sprint Retrospective b. 1. Daily Stand Up 2. Sprint Review 3. Sprint retrospective c. 1. Sprint Review 2. Sprint planning 3. Sprint Retrospective d. 1. Sprint Retrospective 2. Sprint Review 3. Sprint planning Feedback The correct answer is: 1. Sprint planning 2. Sprint Review 3. Sprint Retrospective Question  2 Partially correct Mark 0.67 out of 1.00 Flag question Question text As part of cloud application security, which of the following are the identity and access solutions provided by the cloud service providers? Select one or more: a. Role based authentication b. Single Sign-On/Off c. Federation and Identity Provision d. Multifactor Authentication Feedback The correct answers are: Federation and Identity Provision, Single Sign-On/Off, Multifactor Authentication Question  3 Correct

What is Apache Hive?

Apache Hive is one of Apache's top-level projects. Hive is a data warehouse and ETL for a large dataset in distributed storage. Hive supports different types of storage formats like CSV, TSV, Parquet, ORC (Optimized Row Column), and others. It is used for the analytical processing of structured data using an SQL-like interface. Hive is built on top of Hadoop. Apache Hive      ~ https://hive.apache.org/ Hive is a software project that provided data querying and analysis. It facilitates the reading, writing, and handling of a wide dataset that is stored in distributed storage and queried by SQL syntax, HiveQL.  Hive provides the necessary abstraction to the Hadoop environment by projecting structure on data in HDFS storage so that SQL queries can be integrated with the low-level Java API.  Hive also provides a command-line tool and Java Database Connectivity (JDBC) driver that can be used to connect to Hive.  Hive was co-created by Joydeep Sen Sarma and Ashish Thusoo, Facebook. After