The Google Generative AI integration adds a conversation agent powered by Google Generative AI in Home Assistant.
This conversation agent is unable to control your house. The Google Generative AI conversation agent can be used in automations, but not as a sentence trigger. It can only query information that has been provided by Home Assistant. To be able to answer questions about your house, Home Assistant will need to provide Google Generative AI with the details of your house, which include areas, devices and their states.
This integration requires an API key to use, which you can generate here.
To add the Google Generative AI Conversation integration to your Home Assistant instance, use this My button:
If the above My button doesn’t work, you can also perform the following steps manually:
The Google Generative AI API key is used to authenticate requests to the Google Generative AI API. To generate an API key take the following steps:
- Visit the API Keys page to retrieve the API key you’ll use to configure the integration.
Options for Google Generative AI Conversation can be set via the user interface, by taking the following steps:
- Browse to your Home Assistant instance.
- Go to Settings > Devices & Services.
- If multiple instances of Google Generative AI Conversation are configured, choose the instance you want to configure.
- Select the integration, then select Configure.
The starting text for the AI language model to generate new text from. This text can include information about your Home Assistant instance, devices, and areas and is written using Home Assistant Templating.
Creativity allowed in the responses. Higher values produce a more random and varied response. A temperature of zero will be deterministic.
Allows you to ask Gemini Pro or Gemini Pro Vision to generate content from a prompt consisting of text and optionally images. This service populates response data with the generated content.
|Service data attribute
|The prompt for generating the content.
|Describe this image
|File names for images to include in the prompt.
Very briefly describe what you see in this image from my doorbell camera.
Your message needs to be short to fit in a phone notification. Don't
describe stationary objects or buildings.
The response data field
text will contain the generated content.
Another example with multiple images:
Briefly describe what happened in the following sequence of images
from my driveway camera.