Google Professional Machine Learning Engineer

GCP

Google Professional Machine Learning Engineer

335+ preguntas de práctica con respuestas verificadas por IA

Preguntas reales de examen

Explicación detallada

Lo más cercano al examen real

Explorar 335+ preguntas

Impulsado por IA

Respuestas y explicaciones verificadas por triple IA

Cada respuesta de Google Professional Machine Learning Engineer es verificada de forma cruzada por 3 modelos de IA líderes para garantizar la máxima precisión. Obtén explicaciones detalladas por opción y análisis profundo de cada pregunta.

GPT Pro

Claude Opus

Gemini Pro

Explicaciones por opción

Análisis profundo de preguntas

Precisión por consenso de 3 modelos

Dominios del examen

Architecting Low-Code AI SolutionsPeso 13%

Collaborating Within and Across Teams to Manage Data and ModelsPeso 14%

Scaling Prototypes into ML ModelsPeso 18%

Serving and Scaling ModelsPeso 20%

Automating and Orchestrating ML PipelinesPeso 22%

Monitoring AI SolutionsPeso 13%

Preguntas de práctica

Pregunta 1

You deployed a TensorFlow recommendation model to a Vertex AI Prediction endpoint in us-central1 with autoscaling enabled. Over the last week, you observed sustained traffic of ~1,200 requests per hour (about 20 RPS) during business hours, which is 2x higher than your original estimate, and you need to keep P95 latency under 150 ms during future surges. You want the endpoint to scale efficiently to handle this higher baseline and upcoming spikes without causing user-visible latency. What should you do?

Pregunta 2

You plan to fine-tune a video-frame classifier via transfer learning using a pre-trained ResNet-50 backbone. Your labeled dataset contains 18,000 1080p frames, and you will retrain the model once per day; each training run completes in under 60 minutes on 4 V100 GPUs, and you must minimize infrastructure cost and operational overhead. Which platform components and configuration should you choose?

Pregunta 3

Your team is preparing to train a fraud detection model using data in BigQuery that includes several fields containing PII (for example, card_number, customer_email, and phone_number). The dataset has approximately 250 million rows and every column is required as a feature. Security requires that you reduce the sensitivity of PII before training while preserving each column’s format and length so downstream SQL joins and validations continue to work. The transformation must be deterministic so the same input always maps to the same protected value, and authorized teams must be able to decrypt values for audits. How should you proceed?

Análisis de la pregunta

Core Concept: This question tests privacy-preserving feature engineering for ML using managed de-identification. The key services are Cloud Data Loss Prevention (DLP) for de-identification and Format-Preserving Encryption (FPE), Cloud KMS for key management, and Dataflow for scalable transformation of BigQuery-scale datasets. Why the Answer is Correct: You must reduce PII sensitivity while (1) preserving each column’s format and length, (2) ensuring deterministic mapping (same input -> same output), and (3) enabling authorized re-identification (decrypt) for audits. Cloud DLP’s FPE is designed exactly for this: it produces ciphertext that matches the original data’s character set/length constraints (e.g., credit card-like strings), can be configured deterministically, and supports reversible transformation when paired with appropriate keying material. Using Cloud KMS to protect the wrapping key aligns with enterprise security and auditability requirements. Dataflow provides the throughput needed for ~250M rows and integrates well with BigQuery I/O. Key Features / Configurations: - DLP de-identification template using cryptoReplaceFfxFpeConfig (FPE/FFX mode) to preserve format/length. - Deterministic behavior via consistent keying material and configuration; optionally use a stable surrogate/“tweak” strategy if required by policy. - Cloud KMS-managed key encryption key (KEK) to protect the DLP crypto key material (enables centralized IAM, rotation, and audit logs). - Dataflow pipeline (batch) reading from BigQuery, applying DLP transform to specific columns, and writing back to BigQuery for training. - Principle of least privilege: Dataflow service account needs BigQuery read/write and DLP/KMS permissions; restrict decrypt capability to audit teams. Common Misconceptions: - “Randomizing” values removes PII but breaks joins/validations and is not reversible. - Standard encryption with random salts improves security but defeats determinism and typically changes length/format, breaking downstream SQL expectations. - Dropping PII columns violates the requirement that every column is needed as a feature. Exam Tips: When you see requirements for (a) preserving format/length, (b) deterministic tokenization, and (c) reversible access for authorized users, think Cloud DLP FPE + Cloud KMS. For very large BigQuery datasets, pair it with Dataflow for scalable batch processing and use DLP templates for repeatability and governance (aligned with the Google Cloud Architecture Framework’s security and operational excellence pillars).

Pregunta 4

Your team deployed a regression model that predicts hourly water usage for industrial chillers. Four months after launch, a vendor firmware update changed sensor sampling and units for three input features, and the live feature distributions diverged: 5 of 18 features now have a population stability index > 0.25, 27% of temperature readings fall outside the training range, and production RMSE increased from 0.62 to 1.45. How should you address the input differences in production?

Pregunta 5

You are a data scientist at a city transportation agency tasked with forecasting hourly bike-share demand per station to optimize rebalancing. Your historical trips table in BigQuery contains 24 months of data (~22 million rows) with columns: timestamp, station_id, neighborhood, weather_condition (sunny/rainy/snow), special_event (boolean), and surge_pricing_flag (boolean). You need to choose the most effective combination of a BigQuery ML model and feature engineering to minimize RMSE while capturing weekly/seasonal patterns and handling multiple categorical variables; what should you do?

Análisis de la pregunta

Core Concept: This question tests selecting an appropriate BigQuery ML model type and feature engineering for time-based demand forecasting with many categorical variables. It emphasizes how model choice interacts with feature representation (one-hot vs label encoding) and how to encode seasonality/weekly patterns from timestamps. Why the Answer is Correct: Option A (linear regression + one-hot encoding + explicit time features) is the most effective and reliable BigQuery ML approach among the choices for minimizing RMSE while capturing weekly/seasonal patterns. In BigQuery ML, linear models (LINEAR_REG) work well when you provide informative engineered features. Creating hour-of-day, day-of-week, and month (and often additional features like is_weekend, holiday flags, and cyclic transforms) allows the model to learn periodic demand patterns. One-hot encoding is appropriate for nominal categorical variables (station_id, neighborhood, weather_condition) because it avoids imposing an artificial ordering that label encoding introduces, which can degrade performance and stability. Key Features / Best Practices: - Use BigQuery ML LINEAR_REG with automatic feature preprocessing where appropriate, but explicitly engineer time-derived features to expose periodicity. - One-hot encode nominal categories; consider reducing cardinality (e.g., station_id) via grouping rare stations or using neighborhood-level features if needed for sparsity/compute. - Consider interactions (e.g., station_id x hour, weather x hour) if supported via feature crosses in SQL to capture localized temporal effects. - Use proper train/validation splits by time (e.g., last N weeks as eval) to avoid leakage and to reflect forecasting reality. Common Misconceptions: Boosted trees are often strong, but casting timestamp to a single Unix number (Option B) hides cyclical structure; trees may learn some thresholds but typically won’t represent weekly/seasonal periodicity as cleanly as explicit time features. Label encoding for nominal categories can mislead both linear and tree models by implying rank/order. Autoencoders (Option C) are for representation learning/anomaly detection, not supervised regression RMSE optimization in this setup. Matrix factorization (Option D) is designed for recommendation-style user-item interactions, not time-series demand regression with exogenous variables. Exam Tips: For BigQuery ML forecasting-like problems without using specialized time-series models, prioritize: (1) explicit time feature engineering for periodicity, (2) correct categorical handling (one-hot for nominal), and (3) time-based evaluation splits. Watch for answers that “simplify” timestamps into a single numeric value—this usually harms seasonality learning and is a common trap.

¿Quieres practicar todas las preguntas en cualquier lugar?

Descarga Cloud Pass — incluye exámenes de práctica, seguimiento de progreso y más.

Pregunta 6

You work for a vacation rental marketplace with 1.8 million property listings stored across BigQuery and Cloud Storage; the current search relies on keyword matching and filter chips, but you are seeing more complex semantic queries that reference amenities and metadata (for example, "quiet pet-friendly cabin near a lake with a fireplace, sleeps 6, under $200/night, host rating > 4.7"). You must deliver a revamped semantic search proof of concept within 2 weeks with minimal custom modeling and integration effort that can quickly index both structured listing attributes and unstructured descriptions; what should you choose as the search backend?

Análisis de la pregunta

Core Concept: This question tests choosing a low-code, production-ready semantic search backend on Google Cloud that can index both structured attributes (price, sleeps, ratings) and unstructured text (descriptions) with minimal custom modeling and fast time-to-value. Why the Answer is Correct: Vertex AI Agent Builder (specifically its Search capability, formerly tied to Gen App Builder/Enterprise Search) is designed to stand up semantic search quickly. It provides managed ingestion, indexing, and retrieval over heterogeneous data sources, and supports hybrid retrieval (keyword + semantic) and filtering/faceting over structured metadata—exactly what a marketplace search needs. For a 2-week proof of concept, Agent Builder minimizes integration effort: you configure a data store, connect sources (e.g., Cloud Storage documents/JSON, BigQuery exports or feeds), map metadata fields, and get an API/UI-ready search experience without building and serving your own retrieval stack. Key Features / Best Practices: - Hybrid search: combines lexical matching with semantic relevance to handle complex queries. - Metadata filtering and facets: critical for constraints like “sleeps 6”, “under $200”, “rating > 4.7”, “pet-friendly”. - Managed indexing and relevance tuning: reduces operational burden versus self-managed pipelines. - Rapid POC path: minimal custom modeling; you can optionally add embeddings/LLM-based query understanding later. - Architecture Framework alignment: accelerates delivery (performance and operational excellence) while reducing undifferentiated heavy lifting (reliability and security via managed service). Common Misconceptions: Many candidates jump to “Vector Search” because semantic search implies embeddings. However, vector retrieval alone doesn’t provide a complete search product: you still must build ingestion, chunking, embedding generation, metadata schema, filtering logic, ranking, and query orchestration. Agent Builder packages these capabilities into a cohesive search backend. Exam Tips: - If the prompt emphasizes “2 weeks”, “minimal custom modeling”, and “search backend” with structured + unstructured data, look for managed search solutions (Agent Builder/Search) rather than raw model hosting or standalone vector databases. - Use Vector Search when you are building a custom RAG/retrieval layer and can invest in pipeline and ranking logic; use Agent Builder when you want an end-to-end enterprise search experience quickly. - LLMs are not search indexes; they complement retrieval but don’t replace indexing and filtering requirements.

Pregunta 7

You are a data scientist at a national power utility analyzing 850 million smart-meter readings from 3,000 substations collected over 5 years; for exploratory analysis, you must compute descriptive statistics (mean, median, mode) by device and region, perform complex hypothesis tests (e.g., differences between peak vs off-peak and seasonal periods with multiple comparisons), and plot feature variations at hourly and daily granularity over time, while using as much of the telemetry as possible and minimizing computational resources—what should you do?

Análisis de la pregunta

Core Concept: This question tests choosing the right tools for large-scale exploratory data analysis (EDA) on Google Cloud: push aggregation and filtering to BigQuery (serverless MPP analytics), use a BI tool for interactive visualization, and reserve notebooks for advanced statistics that are not easily expressed in SQL. Why the Answer is Correct: With 850 million time-series readings, importing “full data” into a notebook is inefficient and often infeasible due to memory/IO limits and high compute cost. BigQuery is designed to scan and aggregate massive datasets efficiently and can compute descriptive statistics by device/region (mean, approximate quantiles for median, counts for mode) using SQL at scale. For plotting hourly/daily variations over time, Looker Studio (formerly Data Studio) can query BigQuery directly, enabling interactive dashboards without exporting data or running a notebook continuously. Complex hypothesis tests with multiple comparisons (e.g., t-tests/ANOVA variants, nonparametric tests, p-value adjustments) are better handled in Python/R in Vertex AI Workbench; critically, the notebook should query only the necessary slices/aggregates from BigQuery to minimize resources while still using as much telemetry as possible. Key Features / Best Practices: - BigQuery: partitioning by timestamp and clustering by device_id/region to reduce scanned bytes and cost; approximate quantiles for scalable median; materialized views or scheduled queries for repeated rollups. - Looker Studio: direct BigQuery connector, cached results, parameterized filters for peak/off-peak and seasonal windows. - Vertex AI Workbench: use BigQuery client/BigQuery Storage API to pull only required subsets; run statistical libraries (SciPy/Statsmodels) for hypothesis testing and multiple-comparison corrections. These align with Google Cloud Architecture Framework principles: choose managed services, optimize cost/performance, and separate concerns (analytics vs visualization vs advanced computation). Common Misconceptions: A and B assume notebooks are the primary engine for both aggregation and visualization, but notebooks are not optimized for scanning hundreds of millions of rows and lead to oversized instances and long runtimes. C is close, but it misses the most resource-efficient approach for visualization: using Looker Studio directly on BigQuery avoids notebook-based plotting workloads and supports broad stakeholder exploration. Exam Tips: For very large datasets, default to BigQuery for heavy aggregations and filtering, BI tools for dashboards, and notebooks for specialized analyses. Watch for phrases like “minimize computational resources” and “use as much telemetry as possible”—they usually imply serverless analytics (BigQuery) plus direct-connect visualization rather than exporting data into notebooks.

Pregunta 8

You are launching a grocery delivery mobile app across 3 cities and will use Google Cloud's Recommendations AI to build, test, and deploy product suggestions; you currently capture about 2.5 million user events per day, maintain a catalog of 120,000 SKUs with accurate price and availability, and your business objective is to raise average order value (AOV) by at least 6% within the next quarter while adhering to best practices. Which approach should you take to develop recommendations that most directly increase revenue under these constraints?

Pregunta 9

You are training a LightGBM model to forecast daily inventory for 120 stores using a small dataset (~60 MB) on Vertex AI; your training script needs a system library (libgomp) and several custom Python packages, and each run takes about 10 minutes, so you want job startup time to be under 2 minutes to minimize overhead. How should you configure the Vertex AI custom training job to minimize startup time while keeping the dataset easy to update?

Pregunta 10

You are building an end-to-end scikit-learn MLOps workflow in Vertex AI Pipelines (Kubeflow Pipelines) that ingests 50 GB of CSV data from Cloud Storage, performs data cleaning, feature selection, model training, and model evaluation, then writes a .pkl model artifact to a versioned path in a GCS bucket. You are iterating on multiple versions of the feature selection and training components, submitting each version as a new pipeline run in us-central1 on n1-standard-4 CPU-only executors; each end-to-end run currently takes about 80 minutes. You want to reduce iteration time during development without increasing your GCP costs; what should you do?

Pregunta 11

Your team must deliver an ML solution on Google Cloud to triage warranty claim emails for a global appliance manufacturer into 8 categories within 4 weeks. You are required to use TensorFlow to maintain full control over the model's code, serving, and deployment, and you will orchestrate the workflow with Kubeflow Pipelines. You have 30,000 labeled examples and want to accelerate delivery by leveraging existing resources and managed services instead of training a brand-new model from scratch. How should you build the classifier?

Análisis de la pregunta

Core concept: This question tests when to use transfer learning with TensorFlow on Google Cloud (Vertex AI/legacy AI Platform) versus fully managed “no/low-code” NLP services, under constraints requiring full control of model code, serving, and deployment, and pipeline orchestration with Kubeflow Pipelines. Why the answer is correct: You have 30,000 labeled emails and only 4 weeks, so training a modern NLP model from scratch is unnecessary and risky. The requirement to “use TensorFlow to maintain full control over the model’s code, serving, and deployment” rules out managed black-box training/serving approaches (Natural Language API classification and AutoML Natural Language). The best fit is to start from an established text classification model (for example, a pretrained Transformer encoder or a TF Hub text embedding/classifier backbone) and fine-tune it on your 8 warranty categories. This is classic transfer learning: it accelerates convergence, reduces data requirements, and improves accuracy/time-to-market. You can implement training in TensorFlow, package the model artifact, and deploy it on Vertex AI Prediction (or GKE) with custom containers, all orchestrated via Kubeflow Pipelines. Key features / best practices: Use pretrained language representations (e.g., BERT-style encoders or TF Hub text embeddings) and fine-tune a classification head for 8 classes. Build a Kubeflow Pipeline with components for data validation, preprocessing (tokenization), training, evaluation (precision/recall per class, confusion matrix), and conditional deployment. Use Vertex AI custom training jobs (or GKE) for reproducibility, and Vertex AI Model Registry + endpoints (or KFServing/KServe) for controlled serving. Ensure global email language considerations (multilingual models if needed) and monitor drift. Common misconceptions: Managed APIs (Natural Language API) feel fast, but they don’t provide full control over model code and deployment. AutoML is also fast, but it abstracts training and typically doesn’t satisfy “full control” requirements. Using a pretrained model “as-is” rarely matches domain-specific labels like warranty triage categories. Exam tips: When a question explicitly requires TensorFlow control and custom deployment, prefer custom training/transfer learning over AutoML/APIs. When labels are domain-specific, expect fine-tuning rather than zero-shot or off-the-shelf classification. Map “accelerate delivery” + “limited data” to transfer learning.

Pregunta 12

You are building an anomaly detection model for an industrial IoT platform using Keras and TensorFlow. The last 24 months of sensor events (~900 million rows, ~2.6 TB) are stored in a single partitioned table in BigQuery, and you need to apply feature scaling, categorical encoding, and time-window aggregations in a cost-effective and efficient way before training. The trained model will be used to run weekly batch inference directly in BigQuery against newly ingested partitions. How should you implement the preprocessing workflow?

Análisis de la pregunta

Core Concept: This question tests scalable feature engineering and training data input pipelines when the source of truth is BigQuery and inference will run in BigQuery. It emphasizes pushing preprocessing to the data (BigQuery SQL) and using efficient, distributed ingestion into TensorFlow. Why the Answer is Correct: Option C aligns the entire workflow with BigQuery as the central analytical engine. BigQuery is well-suited for large-scale transformations (2.6 TB, 900M rows) using partition pruning, clustering, window functions, and SQL-based feature engineering. Doing scaling, categorical encoding, and time-window aggregations in BigQuery is cost-effective because you can restrict scans to relevant partitions (e.g., last 24 months) and materialize features into a derived table or view. For training, the TensorFlow I/O BigQuery connector (or equivalent BigQuery-to-tf.data integration) enables streaming data into a tf.data pipeline without exporting massive intermediate files, supporting shuffling, batching, and parallel reads. This also keeps the feature logic consistent with weekly batch inference “directly in BigQuery” (e.g., via BigQuery ML remote models or by applying the same SQL feature view to new partitions). Key Features / Best Practices: - Use partitioned tables and WHERE filters on partition columns to minimize bytes scanned and cost. - Use window functions (e.g., SUM/AVG over time windows) and APPROX functions where appropriate for performance. - Materialize engineered features into a partitioned/clustered feature table to avoid recomputation and improve repeatability. - Ensure training/serving consistency by reusing the same SQL feature definitions for both training and weekly inference. - Follow Google Cloud Architecture Framework principles: optimize cost (partition pruning), performance (BigQuery’s distributed execution), and operational excellence (single source of feature truth). Common Misconceptions: Spark/Dataflow pipelines can be powerful, but exporting large intermediate datasets often increases operational overhead, storage costs, and risks training/serving skew if inference is done in BigQuery with different logic. CSV exports are especially inefficient at this scale. Exam Tips: When data is already in BigQuery and inference will run in BigQuery, prefer SQL-based feature engineering and avoid unnecessary ETL exports. Look for answers that minimize data movement, leverage partitioning/clustering, and keep preprocessing logic consistent across training and serving.

Pregunta 13

Your edtech company operates a live Q&A chat in virtual classrooms, where an automated text moderation model flags toxic messages. After recent complaints, you discover that benign messages referencing certain indigenous festivals are being misclassified as abusive; an audit on a 10,000-message holdout shows a 12–15% false positive rate for messages containing those festival names versus 3% overall, and those references make up <1% of your training set. With a tight budget and an overextended team this quarter, a major overhaul or full replacement is not feasible; what should you do?

Análisis de la pregunta

Core Concept: This question tests responsible ML operations: diagnosing and mitigating bias/representation gaps that cause disparate error rates across subgroups. It also touches on practical model improvement under constraints—targeted data augmentation and retraining rather than large architectural changes. This aligns with the Google Cloud Architecture Framework’s Responsible AI and Operational Excellence principles: measure, monitor, and iteratively improve with minimal-risk changes. Why the Answer is Correct: The audit shows a clear slice-based performance issue: messages containing specific festival names have a 12–15% false positive rate vs 3% overall, and those terms are underrepresented (<1%) in training. This is a classic data imbalance/coverage problem leading to poor generalization for a minority subgroup. Adding targeted, clearly non-toxic examples (synthetic or curated) that include those phrases directly addresses the root cause by improving representation and helping the model learn that these tokens are not inherently toxic. This is the highest-leverage, lowest-cost intervention compared to replacing systems or changing global thresholds. Key Features / Best Practices: Use slice-based evaluation (e.g., by keyword, locale, or demographic proxy) and track subgroup metrics (false positive rate, precision) before/after retraining. Prefer a small, high-quality augmentation set plus validation to avoid overfitting or introducing artifacts. If using synthetic data (LLM-generated), apply human review and deduplication, and ensure it matches production language patterns. Retrain with class/feature balancing and consider calibration checks so predicted toxicity aligns with real-world rates. Common Misconceptions: Raising the global threshold (D) may reduce flags but does not fix the subgroup disparity and can increase false negatives for truly toxic content. Replacing the model (C) is costly and risky; off-the-shelf models often share similar biases and still require domain adaptation. Removing automation (B) is operationally expensive and harms scalability/latency. Exam Tips: When you see “subgroup has much worse error rate” plus “underrepresented in training,” the exam typically expects a data-centric fix: collect/augment representative data, then retrain and re-evaluate with slice metrics. Choose the option that addresses the root cause with minimal blast radius and aligns with responsible AI monitoring and continuous improvement.

Pregunta 14

A fintech analytics team has migrated 12 time-series forecasting and anomaly-detection models to Google Cloud over the last 90 days and is now standardizing new training on Vertex AI. You must implement a system that automatically tracks model artifacts (datasets, feature snapshots, checkpoints, and model binaries) and end-to-end lineage across pipeline steps for dev, staging, and prod; the solution must be simple to adopt via reusable templates, require minimal custom code, retain lineage for at least 180 days, and scale to future models without re-architecting; what should you do?

Análisis de la pregunta

Core Concept: This question tests end-to-end ML governance on Google Cloud: tracking artifacts (datasets, feature snapshots, checkpoints, model binaries) and lineage across pipeline steps/environments using Vertex AI Pipelines and Vertex ML Metadata (MLMD). This aligns with the Google Cloud Architecture Framework pillars of Operational Excellence (repeatable automation), Reliability (consistent provenance), and Security/Compliance (auditability). Why the Answer is Correct: Vertex AI Pipelines (Kubeflow Pipelines on Vertex) automatically integrates with Vertex ML Metadata to record executions, inputs/outputs, and artifact URIs for each pipeline component. Using the Vertex AI SDK and reusable pipeline templates/components provides a low-code adoption path: teams standardize a pipeline pattern once, then future models inherit artifact and lineage tracking without re-architecting. This directly satisfies the requirement for minimal custom code, reusable templates, and scaling to additional models. Key Features / How to Implement: - Define pipelines with the Vertex AI SDK (KFP v2) and standard components (e.g., data extraction, feature generation, training, evaluation, deployment). - Ensure each step produces typed artifacts (Dataset, Model, Metrics, etc.) and writes outputs to durable storage (typically Cloud Storage). MLMD stores metadata/lineage references to these artifacts. - Use separate projects or environments (dev/stage/prod) with consistent pipeline templates; lineage is captured per run and can be queried for audits and debugging. - Retention: MLMD retains lineage/metadata; the 180-day requirement is met by keeping metadata and underlying artifact storage (e.g., GCS lifecycle policies for binaries/checkpoints). If organizational policy requires, configure dataset/model artifact retention and access controls. Common Misconceptions: Some assume MLflow is required for lineage; on Vertex, MLMD is the native lineage system tightly integrated with Pipelines. Others confuse Vertex AI Experiments (run tracking) with full artifact lineage across pipeline steps; Experiments is not a complete lineage solution for multi-step pipelines. Exam Tips: When you see “end-to-end lineage across pipeline steps” and “simple via templates/minimal custom code,” default to Vertex AI Pipelines + Vertex ML Metadata. Prefer native integrations over assembling multiple tools unless requirements explicitly demand third-party portability. Also remember: metadata stores references; artifact retention is handled by the backing storage (often GCS) and lifecycle policies.

Pregunta 15

You manage the ML engineering team for a regional logistics network; most training runs are multi-node PyTorch Lightning jobs on managed training with NVIDIA T4 GPUs where a single experiment consumes ~3,000 GPU-hours, new model versions are released every 6–10 weeks, and finance requires at least a 40% reduction in training compute spend without degrading model quality or materially increasing wall-clock time; your pipeline already writes restartable checkpoints to Cloud Storage every 10 minutes with <2% overhead and can tolerate node interruptions. What should you do to reduce Google Cloud compute costs without impacting the model’s performance?

Análisis de la pregunta

Core Concept: This question tests cost optimization for large-scale training on GPUs, specifically using interruptible capacity (Spot/Preemptible GPUs) together with fault-tolerant training via frequent checkpoints. It also implicitly tests when to choose managed training (Vertex AI Training) versus self-managed orchestration (GKE/Kubeflow) to unlock specific pricing models. Why the Answer is Correct: To achieve at least a 40% reduction in training compute spend without degrading model quality, the most direct lever is switching from on-demand GPU VMs to Spot (preemptible) GPU capacity, which is typically discounted substantially versus on-demand. The workload already writes restartable checkpoints to Cloud Storage every 10 minutes with minimal overhead (<2%) and can tolerate interruptions, which is exactly the prerequisite for using Spot/Preemptible resources without materially increasing wall-clock time. With multi-node distributed PyTorch Lightning, interruptions can be handled by restarting workers and resuming from the latest checkpoint, limiting lost work to roughly the checkpoint interval. Key Features / Configurations / Best Practices: - Use GKE node pools with Spot/Preemptible GPU nodes (e.g., T4) and configure autoscaling and node auto-provisioning as appropriate. - Run training via Kubeflow (e.g., PyTorchJob) so the controller can recreate pods after eviction and resume from Cloud Storage checkpoints. - Ensure checkpointing includes optimizer state, RNG seeds (if needed), and distributed state to avoid quality regressions. - Use PodDisruptionBudgets and appropriate retry/backoff policies; keep checkpoint frequency aligned with acceptable lost work. - This aligns with Google Cloud Architecture Framework cost optimization: use discounted compute for interruptible workloads and design for resiliency. Common Misconceptions: A common trap is assuming Vertex AI Training automatically provides the same Spot GPU savings. While Vertex AI supports various accelerators and managed infrastructure, the option set here implies that the guaranteed way to use Spot/Preemptible GPU node pools is to migrate to GKE/Kubeflow. Another misconception is disabling checkpoints to reduce overhead; however, the overhead is already <2%, and removing checkpoints increases risk of large recomputation after interruptions. Exam Tips: When you see requirements like “>=40% cost reduction,” “interruptions tolerated,” and “frequent checkpoints,” think Spot/Preemptible compute plus robust restart logic. Also, prefer the option that preserves model quality (no algorithmic changes) and avoids large wall-clock increases by minimizing lost work through checkpointing.

Pregunta 16

Your robotics team is deploying a quality inspection system for ceramic floor tiles on a high-speed conveyor line; each tile is captured as a 3840x3840 RGB image under controlled lighting, and you have 60,000 labeled images (defective vs. non-defective); operations managers require pixel-level attribution heatmaps overlaid on each image so they can pinpoint hairline cracks and decide whether to discard a tile within the same shift. How should you build the model?

Análisis de la pregunta

Core Concept: This question tests selecting an appropriate model family for high-resolution image inspection and choosing an explanation method that produces pixel-level attribution heatmaps. In GCP/ML Engineer terms, it’s about using deep vision models (e.g., CNN/ViT in TensorFlow) and applying a gradient-based explainability technique suitable for dense, per-pixel interpretability. Why the Answer is Correct: For 3840x3840 RGB images and hairline crack detection, tree-based models in scikit-learn are a poor fit because they require engineered tabular features and cannot naturally preserve spatial structure. A deep learning vision model (TensorFlow/Keras) can learn spatial patterns directly from pixels and scale to large datasets (60,000 labeled images). Operations managers require pixel-level attribution overlays; Integrated Gradients (IG) is designed for differentiable models and produces saliency/attribution maps aligned to input pixels, making it well-suited for highlighting crack regions. IG is also relatively efficient compared to Shapley-style methods on large inputs. Key Features / Best Practices: Use a CNN-based classifier (or transfer learning with EfficientNet/ResNet) and generate IG attributions for each prediction. Because images are very large, you typically downsample, crop/patch, or use a two-stage approach (coarse model then high-res patch inspection) to meet latency needs. IG requires a baseline (e.g., black image or blurred image) and multiple interpolation steps; choose steps to balance fidelity and throughput. This aligns with the Google Cloud Architecture Framework’s performance and reliability principles: design for low-latency inference and operational usability (interpretable outputs for same-shift decisions). Common Misconceptions: SHAP is popular for explainability, but Shapley methods are computationally expensive and scale poorly with very high-dimensional inputs like 3840x3840x3. PDPs explain global feature effects in tabular settings and do not provide pixel-level localization. Sampled Shapley for images can work in theory but is typically too slow/noisy for high-resolution, high-throughput inspection. Exam Tips: When you see “pixel-level heatmap” + “images” + “localize defects,” prefer deep vision models plus gradient-based attribution (Integrated Gradients, Grad-CAM). Reserve SHAP/PDP primarily for tabular models. Also consider practical constraints: high-dimensional inputs make Shapley-style explanations costly, which matters for production inspection pipelines.

Pregunta 17

A digital payments startup trained a binary classification model on Vertex AI to flag potentially fraudulent card transactions using 24 months of historical data (validation AUC = 0.93) and deployed it to a Vertex AI online endpoint that processes ~60,000 requests per day; after 4 weeks, the production AUC computed from feedback labels has dropped to 0.76, while autoscaling shows sufficient replicas and Cloud Monitoring reports P95 latency around 110 ms and error rate < 0.1%. What should you do first to troubleshoot the drop in predictive performance?

Análisis de la pregunta

Core concept: This question tests ML solution monitoring in production, specifically diagnosing a drop in predictive quality (AUC) when serving health (latency, errors, autoscaling) looks normal. On Vertex AI, the first-line tool is Vertex AI Model Monitoring to detect data drift and training-serving skew, which are common root causes of performance degradation. Why the answer is correct: AUC falling from 0.93 (validation) to 0.76 (production feedback) after several weeks strongly suggests the model is no longer seeing data that matches the training distribution, or the feature engineering/serving pipeline differs from training (skew). Since endpoint replicas are sufficient, P95 latency is stable (~110 ms), and error rate is low, the system is serving predictions correctly and quickly; the issue is likely statistical, not infrastructural. The most effective first troubleshooting step is to quantify whether input feature distributions have shifted (data drift) and whether the online features match the training features (training-serving skew). Model Monitoring provides drift metrics, thresholds, and per-feature signals to quickly identify which features changed and guide remediation (retraining, feature pipeline fixes, or updated labeling strategy). Key features / best practices: Vertex AI Model Monitoring can be configured to: - Capture prediction requests (sampling) and compare feature distributions against a baseline (training or a reference window). - Detect training-serving skew by comparing training data statistics to serving request statistics. - Alert via Cloud Monitoring when drift/skew exceeds thresholds. Operationally, this aligns with the Google Cloud Architecture Framework’s Reliability and Operational Excellence pillars: observe model quality, detect changes early, and automate alerts. After identifying drift, you typically validate upstream data sources, feature store freshness, schema changes, seasonality, new fraud patterns, or policy changes; then decide on retraining cadence, feature updates, or model replacement. Common misconceptions: It’s tempting to focus on CPU/memory, latency, or throughput because those are classic production issues. However, those metrics explain availability/performance, not predictive quality. Explainable AI can help interpret model behavior, but it does not directly diagnose distribution shift or skew, and it’s usually a second step after confirming drift. Exam tips: When a question states predictive metrics degraded while serving metrics are healthy, prioritize monitoring for drift/skew and label/feedback integrity. Use Vertex AI Model Monitoring first, then investigate data pipelines, feature generation, and retraining triggers.

Pregunta 18

You are part of a data science team at a ride‑sharing platform and need to train and compare multiple TensorFlow models on Vertex AI using 850 million labeled trip records (≈2.3 TB) stored in a BigQuery table; training will run on 4–8 workers and you want to minimize data‑ingestion bottlenecks while ensuring the pipeline remains scalable and repeatable. What should you do?

Análisis de la pregunta

Core concept: This question tests scalable input pipelines for distributed TensorFlow training on Vertex AI. The key is decoupling training from the source system (BigQuery) and using an efficient, parallelizable file format and tf.data best practices to avoid input bottlenecks at multi-worker scale. Why the answer is correct: With 850M rows (~2.3 TB) and 4–8 workers, streaming directly from BigQuery or materializing into a single-node structure will bottleneck on network, per-request overhead, and/or BigQuery concurrency limits. Sharded TFRecords in Cloud Storage are a standard, repeatable “training-ready” dataset format: they enable high-throughput sequential reads, easy parallelization across workers, and deterministic reuse across experiments. Proper sharding (e.g., 1–2 GB) balances metadata overhead (too many small files) against parallelism (too few large files). Using tf.data.TFRecordDataset with parallel interleave, map, and prefetch allows overlapping I/O and compute, maximizing accelerator/CPU utilization. Key features / best practices: - Store training data in Cloud Storage in a binary, splittable format (TFRecord) with compression (e.g., GZIP) when appropriate. - Use many shards and let each worker read different shards (via file patterns and dataset sharding options) to reduce contention. - Use tf.data optimizations: parallel_interleave (or Dataset.interleave with num_parallel_calls), map with AUTOTUNE, prefetch(AUTOTUNE), and optionally cache only when it fits. - Make the pipeline repeatable: a one-time (or scheduled) export/transform step from BigQuery to TFRecords can be orchestrated (e.g., Vertex AI Pipelines / Dataflow) and versioned. Common misconceptions: - “Directly read from BigQuery” sounds convenient, but it couples training throughput to BigQuery read performance, quotas, and transient query/streaming behavior, which is risky at scale. - “CSV is universal” but is inefficient: large text parsing overhead, larger storage footprint, and slower input pipelines. - “Load into pandas” is a common prototype pattern but fails for multi-terabyte datasets and distributed training. Exam tips: For large-scale training on Vertex AI, prefer Cloud Storage + TFRecords (or similarly efficient formats) with tf.data performance patterns. Choose architectures that separate data preparation from training, support multi-worker parallel reads, and minimize per-record parsing overhead. When you see TB-scale data and multiple workers, avoid pandas and avoid text formats unless explicitly required.

Pregunta 19

Your e-commerce price-optimization model serves about 30,000 predictions per hour on a Vertex AI endpoint, and a Vertex AI Model Monitoring job is configured to detect training-serving skew using a 24-hour sliding window with a 0.3 sampling rate and a baseline dataset at gs://retail-ml/training/2025-06/data.parquet; after three consecutive windows reporting skew on features inventory_days and competitor_price, you retrained the model using the last 45 days of data at gs://retail-ml/training/2025-08/data.parquet and deployed version v2 to the same endpoint, but the monitoring job still raises the same skew alert—what should you do?

Análisis de la pregunta

Core Concept: This question tests Vertex AI Model Monitoring for training-serving skew. Skew detection compares the distribution of features seen in online predictions (serving data) against a configured baseline dataset (typically the training dataset) over a defined window (here, a 24-hour sliding window) and sampling rate. Why the Answer is Correct: You retrained and deployed model version v2 using a newer 45-day dataset (gs://retail-ml/training/2025-08/data.parquet), but the monitoring job is still configured to compare serving traffic against the old baseline (gs://retail-ml/training/2025-06/data.parquet). If the feature distributions legitimately shifted between June and August (common in retail pricing, inventory, and competitor dynamics), the monitor will continue to flag skew because it is still measuring serving data against an outdated reference distribution. The correct remediation is to update the Model Monitoring job’s baseline to the dataset that corresponds to the currently deployed model’s training data. Key Features / Best Practices: - Baseline management: For training-serving skew, the baseline should align with the deployed model version’s training set (or a curated “golden” reference) and be updated when the model is retrained. - Windowing and sampling: Sliding windows and sampling rate affect sensitivity and cost, but they do not fix a baseline mismatch. - Operational practice: Treat baseline updates as part of the deployment/release process (MLOps). This aligns with the Google Cloud Architecture Framework’s operational excellence and reliability principles—monitoring must reflect the current system state. Common Misconceptions: - Lowering sampling (Option A) may reduce alert frequency but can hide real issues and does not address the root cause (wrong baseline). - Waiting 72 hours (Option C) doesn’t help if the comparison target remains the old baseline; more traffic just provides more evidence of the same mismatch. - Disabling alerts until retraining again (Option D) is an anti-pattern: it reduces observability and can allow real drift/skew to go undetected. Exam Tips: When a monitoring alert persists after a model update, check configuration coupling: baseline dataset, feature schema, objective thresholds, and which model/version the monitor is attached to. For training-serving skew specifically, the first thing to verify is that the baseline corresponds to the training data for the currently deployed model version.

Pregunta 20

You trained an automated scholarship eligibility classifier for a national education nonprofit using Vertex AI on 1.2 million labeled applications, reaching an offline ROC AUC of 0.95; the review board is concerned that predictions may be biased by applicant demographics (e.g., gender, ZIP-code–derived income bracket, first-generation college status) and asks you to deliver transparent insight into how the model makes decisions for 500 sampled approvals and denials and to identify any fairness issues across these cohorts. What should you do?

Análisis de la pregunta

Core Concept: This question tests model transparency and fairness evaluation in Vertex AI—specifically using explainability (feature attribution) to understand why individual predictions were made and then analyzing those explanations across demographic cohorts to detect potential bias. This aligns with responsible AI practices in the Google Cloud Architecture Framework (governance, risk management, and trust). Why the Answer is Correct: The board requests “transparent insight” for 500 sampled approvals/denials and to “identify fairness issues across cohorts.” Vertex AI feature attribution (Vertex Explainable AI) provides per-prediction explanations (local explanations) showing how each input feature contributed to a specific decision. By aggregating attributions and outcomes by cohort (e.g., gender, income bracket, first-gen status), you can identify whether sensitive or proxy features disproportionately drive approvals/denials, and whether similarly qualified applicants receive different outcomes across groups—key evidence for bias investigation. Key Features / How to Do It: Use Vertex AI Explainable AI on the deployed model or batch predictions for the 500 sampled cases to obtain attributions (e.g., Integrated Gradients / sampled Shapley depending on model type). Then slice results by cohort and compare: (1) distribution of prediction scores, (2) top contributing features, and (3) fairness metrics such as demographic parity difference, equal opportunity / TPR gaps, and calibration by group (often computed outside Vertex AI using BigQuery/Looker/Python, but driven by the attribution outputs). Also look for proxy variables (ZIP-code–derived income) acting as sensitive-feature surrogates. Common Misconceptions: High ROC AUC (0.95) does not imply fairness; it can coexist with discriminatory behavior. Monitoring drift/skew is operationally important but does not answer “why” decisions are made or whether they are biased. Removing demographic features may not remove bias because proxies remain, and it can reduce the ability to measure/mitigate fairness. Exam Tips: When the prompt asks for transparency, interpretability, and cohort-based bias analysis, think “Vertex Explainable AI/feature attribution + slice by groups.” When it asks for production data drift or training-serving skew, think “Vertex Model Monitoring.” For fairness, prefer measurement and evidence (explanations + group metrics) before remediation steps like reweighting, constraints, or feature removal.

Historias de éxito(7)

C***************Nov 24, 2025

Periodo de estudio: 1 month

Just want to say a massive thank you to the entire Cloud pass, for helping me pass my exam first time. I wont lie, it wasn't easy, especially the way the real exam is worded, however the way practice questions teaches you why your option was wrong, really helps to frame your mind and helps you to understand what the question is asking for and the solutions your mind should be focusing on. Thanks once again.

f****Nov 23, 2025

Periodo de estudio: 1 month

Good questions banks and explanations that help me practise and pass the exam.

민

민**Nov 12, 2025

Periodo de estudio: 1 month

강의 듣고 바로 문제 풀었는데 정답률 80% 가량 나왔고, 높은 점수로 시험 합격했어요. 앱 잘 이용했습니다

S************Nov 11, 2025

Periodo de estudio: 1 month

Good mix of theory and practical scenarios

A***********Nov 6, 2025

Periodo de estudio: 1 month

I used the app mainly to review the fundamentals—data preparation, model tuning, and deployment options on GCP. The explanations were simple and to the point, which really helped before the exam.

Exámenes de práctica

Otras certificaciones de GCP

Google Professional Cloud DevOps Engineer

Professional

Google Associate Cloud Engineer

Associate

Google Professional Cloud Network Engineer

Professional

Google Associate Data Practitioner

Associate

Google Cloud Digital Leader

Foundational

Google Professional Cloud Security Engineer

Professional

Google Professional Cloud Architect

Professional

Google Professional Cloud Database Engineer

Professional

Google Professional Data Engineer

Professional

Google Professional Cloud Developer

Professional

Comienza a practicar ahora

Descarga Cloud Pass y comienza a practicar todas las preguntas de Google Professional Machine Learning Engineer.

¿Quieres practicar todas las preguntas en cualquier lugar?

Obtén la app

Descarga Cloud Pass — incluye exámenes de práctica, seguimiento de progreso y más.

Cloud Pass

GCP

Google Professional Machine Learning Engineer

335+ preguntas de práctica con respuestas verificadas por IA

Preguntas reales de examen

Explicación detallada

Lo más cercano al examen real

Explorar 335+ preguntas

Impulsado por IA

Respuestas y explicaciones verificadas por triple IA

GPT Pro

Claude Opus

Gemini Pro

Explicaciones por opción

Análisis profundo de preguntas

Precisión por consenso de 3 modelos

Dominios del examen

Architecting Low-Code AI SolutionsPeso 13%

Collaborating Within and Across Teams to Manage Data and ModelsPeso 14%

Scaling Prototypes into ML ModelsPeso 18%

Serving and Scaling ModelsPeso 20%

Automating and Orchestrating ML PipelinesPeso 22%

Monitoring AI SolutionsPeso 13%

Preguntas de práctica

Pregunta 1

Pregunta 2

Pregunta 3

Análisis de la pregunta

Pregunta 4

Pregunta 5

Análisis de la pregunta

¿Quieres practicar todas las preguntas en cualquier lugar?

Descarga Cloud Pass — incluye exámenes de práctica, seguimiento de progreso y más.

Pregunta 6

Análisis de la pregunta

Pregunta 7

Análisis de la pregunta

Pregunta 8

Pregunta 9

Pregunta 10

Pregunta 11

Análisis de la pregunta

Pregunta 12

Análisis de la pregunta

Pregunta 13

Análisis de la pregunta

Pregunta 14

Análisis de la pregunta

Pregunta 15

Análisis de la pregunta

Pregunta 16

Análisis de la pregunta

Pregunta 17

Análisis de la pregunta

Pregunta 18

Análisis de la pregunta

Pregunta 19

Análisis de la pregunta

Pregunta 20

Análisis de la pregunta

Historias de éxito(7)

C***************Nov 24, 2025

Periodo de estudio: 1 month

f****Nov 23, 2025

Periodo de estudio: 1 month

Good questions banks and explanations that help me practise and pass the exam.

민

민**Nov 12, 2025

Periodo de estudio: 1 month

강의 듣고 바로 문제 풀었는데 정답률 80% 가량 나왔고, 높은 점수로 시험 합격했어요. 앱 잘 이용했습니다

S************Nov 11, 2025

Periodo de estudio: 1 month

Good mix of theory and practical scenarios

A***********Nov 6, 2025

Periodo de estudio: 1 month

I used the app mainly to review the fundamentals—data preparation, model tuning, and deployment options on GCP. The explanations were simple and to the point, which really helped before the exam.

Exámenes de práctica

Practice Test #1

50 Preguntas·120 min·Aprobación 700/1000

Practice Test #2

50 Preguntas·120 min·Aprobación 700/1000

Practice Test #3

50 Preguntas·120 min·Aprobación 700/1000

Otras certificaciones de GCP

Google Professional Cloud DevOps Engineer

Professional

Google Associate Cloud Engineer

Associate

Google Professional Cloud Network Engineer

Professional

Google Associate Data Practitioner

Associate

Google Cloud Digital Leader

Foundational

Google Professional Cloud Security Engineer

Professional

Google Professional Cloud Architect

Professional

Google Professional Cloud Database Engineer

Professional

Google Professional Data Engineer

Professional

Google Professional Cloud Developer

Professional

Comienza a practicar ahora

Descarga Cloud Pass y comienza a practicar todas las preguntas de Google Professional Machine Learning Engineer.

¿Quieres practicar todas las preguntas en cualquier lugar?

Obtén la app

Descarga Cloud Pass — incluye exámenes de práctica, seguimiento de progreso y más.