The purpose of this Sample Question Set is to provide you with information about the Databricks Certified Generative AI Engineer Associate exam. These sample questions will make you very familiar with both the type and the difficulty level of the questions on the Generative AI Engineer Associate certification test. To get familiar with real exam environment, we suggest you try our Sample Databricks Generative AI Engineer Associate Certification Practice Exam. This sample practice exam gives you the feeling of reality and is a clue to the questions asked in the actual Databricks Certified Generative AI Engineer Associate certification exam.
These sample questions are simple and basic questions that represent likeness to the real Databricks Certified Generative AI Engineer Associate exam questions. To assess your readiness and performance with real-time scenario based questions, we suggest you prepare with our Premium Databricks Generative AI Engineer Associate Certification Practice Exam. When you solve real time scenario based questions practically, you come across many difficulties that give you an opportunity to improve.
Databricks Generative AI Engineer Associate Sample Questions:
01. Which steps are essential when writing chunked text into Delta Lake tables in Unity Catalog?
(Select TWO)
a) Writing all chunks as a single file
b) Partitioning data based on categories
c) Avoiding partitions for simplicity
d) Structuring chunks to support efficient querying
02. A Generative AI Engineer is loading 150 million embeddings into a vector database that takes a maximum of 100 million. Which TWO actions can they take to reduce the record count?
a) Increase the document chunk size
b) Decrease the overlap between chunks
c) Decrease the document chunk size
d) Increase the overlap between chunks
e) Use a smaller embedding model
03. A Generative AI Engineer would like to build an application that can update a memo field that is about a paragraph long to just a single sentence gist that shows intent of the memo field, but fits into their application front end.
With which Natural Language Processing task category should they evaluate potential LLMs for this application?
a) text2text Generation
b) Sentencizer
c) Text Classification
d) Summarization
04. A Generative AI Engineer is creating a LLM-based application. The documents for its retriever have been chunked to a maximum of 512 tokens each. The Generative AI Engineer knows that cost and latency are more important than quality for this application. They have several context length levels to choose from.
Which will fulfill their need?
a) context length 512: smallest model is 0.13GB with and embedding dimension 384
b) context length 514: smallest model is 0.44GB and embedding dimension 768
c) context length 2048: smallest model is 11GB and embedding dimension 2560
d) context length 32768: smallest model is 14GB and embedding dimension 4096
05. A Generative AI Engineer is building a RAG application that will rely on context retrieved from source documents that have been scanned and saved as image files in formats like .jpeg or .png. They want to develop a solution using the least amount of lines of code.
Which Python package should be used to extract the text from the source documents?
a) beautifulsoup
b) scrapy
c) pytesseract
d) pyquery
06. Which of the following are essential components when designing a prompt for a Generative AI model?
(Select TWO)
a) Clear intent
b) Multiple ambiguous instructions
c) Examples of expected output
d) Irrelevant details to increase model flexibility
07. Which of the following strategies would best help in designing a prompt that leads to a well-formatted response in an LLM?
(Select TWO)
a) Provide vague instructions for flexibility
b) Use structured steps for formatting
c) Use simple language with clear directives
d) Incorporate multiple unrelated tasks in a single prompt
08. Which of the following are key considerations when identifying source documents for a RAG application?
(Select TWO)
a) Information density
b) Document size
c) Document format
d) Document relevance to the task
09. A Generative AI Engineer is assessing the responses from a customer-facing GenAI application that they are developing to assist in selling automotive parts. The application requires the customer to explicitly input account_id and transaction_id to answer questions.
After initial launch, the customer feedback was that the application did well on answering order and billing details, but failed to accurately answer shipping and expected arrival date questions.
Which of the following receivers would improve the application's ability to answer these questions?
a) Create a vector store that includes the company shipping policies and payment terms for all automotive parts
b) Create a feature store table with transaction_id as primary key that is populated with invoice data and expected delivery date
c) Provide examples data for expected arrival dates as a tuning dataset, then periodically fine-tune the model so that it has updated shipping information
d) Amend the chat prompt to input when the ordered was placed and instruct the model to add 14 days to that as no shipping method is expected to exceed 14 days
10. Which of the following is the most effective way to design a prompt that elicits a specific format in the response?
a) Provide clear examples of the desired format
b) Use short and ambiguous instructions
c) Avoid specifying any format in the prompt
d) Use multiple tasks in a single prompt
Answers:
Question: 01 Answer: b, d |
Question: 02 Answer: a, b |
Question: 03 Answer: d |
Question: 04 Answer: a |
Question: 05 Answer: c |
Question: 06 Answer: a, c |
Question: 07 Answer: b, c |
Question: 08 Answer: a, d |
Question: 09 Answer: b |
Question: 10 Answer: a |
Note: For any error in Databricks Certified Generative AI Engineer Associate certification exam sample questions, please update us by writing an email on feedback@certfun.com.