Skip to contents

What you need from Azure

foundryR talks to deployed Azure AI Foundry and Azure OpenAI resources. Before writing R code, create or identify:

  1. An Azure OpenAI resource or Azure AI Foundry project with an OpenAI endpoint.
  2. At least one chat or Responses API deployment, for example gpt-5-nano.
  3. An embedding deployment, for example text-embedding-3-small, if you plan to use embeddings.
  4. A Content Safety resource if you plan to use moderation, groundedness, or prompt-shield checks.
  5. Either API keys or a Microsoft Entra ID token.

In the Azure portal, open your Azure OpenAI resource, then use Keys and Endpoint to copy the endpoint URL and an API key. In Azure AI Foundry, use the deployments page to create model deployments and record their deployment names.

Deployment name vs base model name

The value you pass to model = is the deployment name you chose in Azure, not necessarily the base model name. If you deploy base model gpt-5-nano with deployment name my-gpt4, use model = "my-gpt4" in foundryR. The same rule applies to embedding deployments.

Install foundryR

install.packages("pak")
pak::pak("farach/foundryR")

Configure credentials

Set credentials for the current R session:

library(foundryR)

foundry_set_endpoint(Sys.getenv("AZURE_FOUNDRY_ENDPOINT"))
foundry_set_key("your-api-key")

For persistent local configuration, use store = TRUE:

foundry_set_endpoint(Sys.getenv("AZURE_FOUNDRY_ENDPOINT"), store = TRUE)
foundry_set_key("your-api-key", store = TRUE)

You can also edit .Renviron directly:

usethis::edit_r_environ()

Add values like these, then restart R:

AZURE_FOUNDRY_ENDPOINT=https://<resource-name>.openai.azure.com
AZURE_FOUNDRY_KEY=your-api-key
AZURE_FOUNDRY_MODEL=my-gpt4
AZURE_FOUNDRY_EMBED_MODEL=my-embedding-deployment

Keyless authentication with Microsoft Entra ID

API keys are convenient for local testing. For enterprise environments that already use service principals, managed identity, or Azure role-based access control, use a Microsoft Entra ID bearer token:

foundry_set_token("your-entra-token")

foundryR sends the token in the Authorization header. If both a token and an API key are configured, the token takes precedence for supported calls.

Validate setup

Test a specific deployment:

foundry_check_setup(model = "gpt-5-nano")

If you need to see deployments exposed by the v1 model metadata endpoint, use:

models <- foundry_models()
models[, c("id", "owned_by")]

First Responses API call

The Responses API is the newer v1 surface for stateful turns, strict structured outputs, tools, and richer token metadata. The examples below omit model =, so foundryR reads the deployment from AZURE_FOUNDRY_MODEL; pass model = to target a specific deployment.

library(foundryR)

response <- foundry_response("Answer in one sentence: what is R?")

response$output_text
#> [1] "R is a free, open-source programming language and environment for statistical computing and graphics, widely used for data analysis and visualization."

Chain a follow-up turn with previous_response_id:

follow_up <- foundry_response(
  "Explain why that matters for data analysis in one sentence.",
  previous_response_id = response$response_id
)

follow_up$output_text
#> [1] "Because R<U+2019>s free, open-source nature plus its extensive ecosystem of packages for data manipulation, statistics, modeling, and high-quality graphics enables powerful, reproducible data analysis and visualization without licensing constraints."

First strict extraction

Use JSON Schema when you need model output to become analyzable columns:

schema <- list(
  type = "object",
  properties = list(
    sentiment = list(type = "string", enum = c("positive", "negative", "neutral")),
    topic = list(type = "string")
  ),
  required = c("sentiment", "topic"),
  additionalProperties = FALSE
)

foundry_extract(
  c("The tutorial was clear.", "I needed more examples."),
  schema = schema
)
#> # A tibble: 2 × 10
#>   .input_idx .input_text     .response_id .status .output_text .error .error_msg
#>        <int> <chr>           <chr>        <chr>   <chr>        <lgl>  <chr>     
#> 1          1 The tutorial w… resp_0e928a… comple… "{\"sentime… FALSE  NA        
#> 2          2 I needed more … resp_01f233… comple… "{\"sentime… FALSE  NA        
#> # ℹ 3 more variables: raw_response <list>, sentiment <chr>, topic <chr>

foundry_extract() uses strict JSON Schema mode by default for supported models.

First embedding

Embeddings convert text to numeric vectors for clustering, semantic search, near-duplicate detection, and downstream models:

texts <- c(
  "The tutorial was clear.",
  "The lecture needed more examples.",
  "The assignment instructions were easy to follow."
)

embeddings <- foundry_embed(texts, model = "text-embedding-3-small")
foundry_similarity(embeddings)
#> # A tibble: 3 × 3
#>   text_1                            text_2                            similarity
#>   <chr>                             <chr>                                  <dbl>
#> 1 The tutorial was clear.           The assignment instructions were…      0.580
#> 2 The tutorial was clear.           The lecture needed more examples.      0.337
#> 3 The lecture needed more examples. The assignment instructions were…      0.290

Configure Content Safety

Content Safety uses a separate Azure AI Content Safety resource. In the Azure portal, create an Azure AI Content Safety resource, open Keys and Endpoint, then configure foundryR:

foundry_set_content_safety_endpoint(Sys.getenv("AZURE_CONTENT_SAFETY_ENDPOINT"))
foundry_set_content_safety_key("your-content-safety-key")

Use groundedness and shields as auditable safety gates:

source <- "The program enrolled 82 students in 2026."
answer <- "The program enrolled 82 students in 2026."

grounded <- foundry_groundedness(
  text = answer,
  grounding_sources = source,
  query = "How many students enrolled?",
  task = "QnA"
)

shield <- foundry_shield(user_prompt = "Summarize this document.")

grounded
#> # A tibble: 1 × 6
#>   grounded grounded_pct ungrounded_pct ungrounded_segments ungrounded_reasons
#>   <lgl>           <dbl>          <int> <list>              <list>            
#> 1 TRUE                1              0 <chr [0]>           <chr [0]>         
#> # ℹ 1 more variable: correction_text <chr>
shield
#> # A tibble: 1 × 3
#>   source      content                  attack_detected
#>   <chr>       <chr>                    <lgl>          
#> 1 user_prompt Summarize this document. FALSE

Most foundryR calls stay within your Azure OpenAI or Content Safety resources. Web search is different. Microsoft documents that Grounding with Bing can send data outside the compliance and geographic boundary and can incur separate costs. Do not send secrets or regulated data to web-search prompts.

Chat completions

Chat completions are still available for simple assistant replies:

foundry_chat("Answer in one sentence: what is the tidyverse?")
#> # A tibble: 1 × 9
#>   role      content          model finish_reason prompt_tokens completion_tokens
#>   <chr>     <chr>            <chr> <chr>                 <int>             <int>
#> 1 assistant The tidyverse i… gpt-… stop                     17               312
#> # ℹ 3 more variables: reasoning_tokens <int>, cached_input_tokens <int>,
#> #   total_tokens <int>

For interactive streaming chat and chat-first agent workflows, use ellmer.

Next steps