CCLee / Blog / MCP Course Week 1: OpenAI Model via Azure, Clients and "Raw" Tools

MCP Course Week 1: OpenAI Model via Azure, Clients and "Raw" Tools

1.openai.AzureOpenAI: Deploy an LLM Model via Azure
- 1.1.Signature to Instantiate an AzureOpenAI model
- 1.2.Steps to Obtain Deployed Model and Necessary Credentials
2.Chat Client
3.Agents and Tools
- 3.1.Define Tools
- 3.2.Apply the Tools
4.Reference

November 19, 2025

Ai

Llm

Python

1.openai.AzureOpenAI: Deploy an LLM Model via Azure
- 1.1.Signature to Instantiate an AzureOpenAI model
- 1.2.Steps to Obtain Deployed Model and Necessary Credentials
2.Chat Client
3.Agents and Tools
- 3.1.Define Tools
- 3.2.Apply the Tools
4.Reference

1. `openai.AzureOpenAI`: Deploy an LLM Model via Azure

1.1. Signature to Instantiate an `AzureOpenAI` model

To instantiate an openai instance we need:

1from dotenv import load_dotenv
2from openai import AzureOpenAI
3import os
4
5load_dotenv(override=True)
6
7client = AzureOpenAI(
8    api_key=os.getenv("AZURE_API_KEY"),
9    api_version=,
10    azure_endpoint=,
11)

both api_version and azure_endpoint can be obtained after deploying a model successfully, let's see how we do it:

1.2. Steps to Obtain Deployed Model and Necessary Credentials

We go through the following steps:

Go to azure cloud service portal
Select Azure OpenAI service:
Click on the newly created project, and then go to Overview:
Click on Explore Azure AI Foundry Portal for a complete list of models:
We get a bunch of models:
Choose a model and click Use this model:
Now deploy a model (the charging scheme bases on Deployment type, in standard type you cost nothing after deployment if you never use it):

After deployment we are provided a target URI

the full value is:

https://shellscriptmanager.openai.azure.com/openai/deployments/gpt-4.1-mini/chat/completions?api-version=2025-01-01-preview

From this we can fill up our client definition:

1import os
2from openai import AzureOpenAI
3from dotenv import load_dotenv
4
5load_dotenv(override=True)
6
7client = AzureOpenAI(
8    api_key=os.getenv("AZURE_API_KEY"),
9    api_version="2025-01-01-preview",
10    azure_endpoint="https://shellscriptmanager.openai.azure.com"
11)

By the way we can get the AZURE_API_KEY from the Home page.

2. Chat Client

2.1. Simple Chat Demonstration

Let's test a simple conversation with our chat model:

1messages = [{"role": "user", "content": "What is 2+2?"}]
2response = client.chat.completions.create(
3    model="gpt-4.1-mini",
4    messages=messages
5)
6print(response.choices[0].message.content)

From this we get:

2.2. System Prompt and User Prompt

For a chat client there are two kinds of prompt to control the behaviour of a chat model:

The system prompt is intended to be more the overall instructions that sets the context for the task.
The user prompt is the actual question coming from the user.

2.3. Example: Read my Resume (pdf format) and answer the Question in that Resume

2.3.1. Preparation for Background Data

Let's load our resume (in pdf format) into texts:

1from pypdf import PdfReader
2
3reader = PdfReader("me/james_lee.pdf")
4linkedin = ""
5for page in reader.pages:
6    text = page.extract_text()
7    if text:
8        linkedin += text
9
10with open("me/summary.txt", "r", encoding="utf-8") as f:
11    summary = f.read()
12
13name = "James Lee"

This prepares the varibles linkedin, summary and name.

2.3.2. Preparation for System Prompt

1system_prompt = f"""
2You are acting as {name}. You are answering questions on {name}'s website,
3particularly questions related to {name}'s career, background, skills and experience.
4Your responsibility is to represent {name} for interactions on the website as faithfully as possible.
5You are given a summary of {name}'s background and LinkedIn profile which you can use to answer questions.
6Be professional and engaging, as if talking to a potential client or future employer who came across the website.
7If you don't know the answer, say so.
8"""
9
10system_prompt += f"\n\n## Summary:\n{summary}\n\n## LinkedIn Profile:\n{linkedin}\n\n"
11system_prompt += f"With this context, please chat with the user, always staying in character as {name}."

2.3.3. Inject User chat Message as a User Prompt

1def chat_to_gpt4mini(messages):
2    client.chat.completions.create(
3        model="gpt-4.1-mini",
4        messages=messages
5    )
6
7def chat(message, history):
8    messages = [{"role": "system", "content": system_prompt}] \
9                + history \
10                + [{"role": "user", "content": message}]
11    response = chat_to_gpt4mini(messages)
12    return response.choices[0].message.content

2.3.4. Chat UI via `gradio`

1import gradio as gr
2
3gr.ChatInterface(chat, type="messages").launch()

We got a chat interface:

gradio will instantiate an empty list history = [] for us and update the history of our conversation by mutating the history object.

2.3.5. Example of Asking the Detail in my CV:

2.4. Define Schema for Chat Client's Response via `pydantic`

We can define the schema of the response from LLM using response_format (which we also use extensively in FastAPI):

1from pydantic import BaseModel
2
3class Evaluation(BaseModel):
4    is_acceptable: bool
5    feedback: str

Now we can put constraint on the response schema via:

1# note it is parse(), not create()
2response = client.chat.completions.parse(
3    model="gpt-4.1-mini",
4    messages=messages,
5    response_format=Evaluation
6)
7result = response.fhoices[0].message.parsed

2.5. System Prompt can be Dynamic

Note that system prompt can be changed according to the incoming message before we push the data to the LLM for answer:

1def chat(message, history):
2    if "patent" in message:
3        system = system_prompt + "\n\nEverything in your reply needs to be in pig latin - \
4              it is mandatory that you respond only and entirely in pig latin"
5    else:
6        system = system_prompt
7    // the updated system prompt:
8    messages = [{"role": "system", "content": system}] \
9                + history \
10                + [{"role": "user", "content": message}]
11    response = openai.chat.completions.create(model="gpt-4o-mini", messages=messages)
12    reply =response.choices[0].message.content
13
14    # we evalute the resposne via another model:
15    evaluation = evaluate(reply, message, history)
16    
17    if evaluation.is_acceptable:
18        print("Passed evaluation - returning reply")
19    else:
20        print("Failed evaluation - retrying")
21        print(evaluation.feedback)
22        reply = rerun(reply, message, history, evaluation.feedback)       
23    return reply

We are free the prepend any adjusted system prompt to messages and inject it into our chat client.

When we need to rerun the flow, we adjust the system prompt again in rerun:

1def evaluate(reply, message, history) -> Evaluation:
2    # sytesm prompt: provide the rule for the evluation:
3    messages = [{"role": "system", "content": evaluator_system_prompt}] \
4    # user prompt: provide the data for the evaluation, ask the chat client to evalute this as well
5    + [{"role": "user", "content": evaluator_user_prompt(reply, message, history)}]
6    response = gemini.beta.chat.completions.parse(model="gemini-2.0-flash", messages=messages, response_format=Evaluation)
7    return response.choices[0].message.parsed
8
9def rerun(reply, message, history, feedback):
10    updated_system_prompt = system_prompt + "\n\n## Previous answer rejected\nYou just tried to reply, but the quality control rejected your reply\n"
11    updated_system_prompt += f"## Your attempted answer:\n{reply}\n\n"
12    updated_system_prompt += f"## Reason for rejection:\n{feedback}\n\n"
13    messages = [{"role": "system", "content": updated_system_prompt}] + history + [{"role": "user", "content": message}]
14    response = openai.chat.completions.create(model="gpt-4o-mini", messages=messages)
15    return response.choices[0].message.content

3. Agents and Tools

This part is now replaced my MCP, but we leave a record here to understand what's happening under the hood.

The purpose of this section is to explain how complex is bringing tools into the applications and thus why we need MCP for the abstraction.

3.1. Define Tools

Usually we define tools as ordinary functions:

1def record_user_details(email, name="Name not provided", notes="not provided"):
2    push(f"Recording interest from {name} with email {email} and notes {notes}")
3    return {"recorded": "ok"}
4
5def record_unknown_question(question):
6    push(f"Recording {question} asked that I couldn't answer")
7    return {"recorded": "ok"}

Next we define the metadata for the tools so that our chat client can grab the right choice(s) based on their descriptions and the user prompt (i.e., user message):

1record_user_details_json = {
2    "name": "record_user_details",
3    "description": "Use this tool to record that a user is interested in being in touch and provided an email address",
4    "parameters": {
5        "type": "object",
6        "properties": {
7            "email": {
8                "type": "string",
9                "description": "The email address of this user"
10            },
11            "name": {
12                "type": "string",
13                "description": "The user's name, if they provided it"
14            }
15            ,
16            "notes": {
17                "type": "string",
18                "description": "Any additional information about the conversation that's worth recording to give context"
19            }
20        },
21        "required": ["email"],
22        "additionalProperties": False
23    }
24}
25
26record_unknown_question_json = {
27    "name": "record_unknown_question",
28    "description": "Always use this tool to record any question that couldn't be answered as you didn't know the answer",
29    "parameters": {
30        "type": "object",
31        "properties": {
32            "question": {
33                "type": "string",
34                "description": "The question that couldn't be answered"
35            },
36        },
37        "required": ["question"],
38        "additionalProperties": False
39    }
40}

We combine the definitions to get:

1tools = [{"type": "function", "function": record_user_details_json},
2        {"type": "function", "function": record_unknown_question_json}]

3.2. Apply the Tools

And finally we apply the tools via the following code executions.

Be sure to understand

〈2.3.3. Inject User chat Message as a User Prompt〉 and
〈2.3.4. Chat UI via gradio〉

as we apply the same chat UI:

1def handle_tool_calls(tool_calls):
2    results = []
3    for tool_call in tool_calls:
4        tool_name = tool_call.function.name
5        arguments = json.loads(tool_call.function.arguments)
6        print(f"Tool called: {tool_name}", flush=True)
7        tool = globals().get(tool_name)
8        result = tool(**arguments) if tool else {}
9        results.append({"role": "tool","content": json.dumps(result),"tool_call_id": tool_call.id})
10    return results
11
12def chat(message, history):
13    messages = [{"role": "system", "content": system_prompt}] + history + [{"role": "user", "content": message}]
14    done = False
15    while not done:
16
17        # This is the call to the LLM - see that we pass in the tools json
18        response = client.chat.completions.create(
19            model="gpt-4.1-mini", messages=messages, tools=tools)
20
21        finish_reason = response.choices[0].finish_reason
22        
23        # If the LLM wants to call a tool, we do that!
24        if finish_reason=="tool_calls":
25            message = response.choices[0].message
26            tool_calls = message.tool_calls
27            results = handle_tool_calls(tool_calls)
28            messages.append(message)
29            messages.extend(results)
30        else:
31            done = True
32    return response.choices[0].message.content

Bringing tools into chat client is so tedious, complex and not maintainable, and the advent of MCP now simplifies the tooling process.

4. Reference

Ed Donner, AI Engineer Agentic Track: The Complete Agent & MCP Course, Udemy

Contents

Contents

1. `openai.AzureOpenAI`: Deploy an LLM Model via Azure

1.1. Signature to Instantiate an `AzureOpenAI` model

1.2. Steps to Obtain Deployed Model and Necessary Credentials

2. Chat Client

2.1. Simple Chat Demonstration

2.2. System Prompt and User Prompt

2.3. Example: Read my Resume (pdf format) and answer the Question in that Resume

2.3.1. Preparation for Background Data

2.3.2. Preparation for System Prompt

2.3.3. Inject User chat Message as a User Prompt

2.3.4. Chat UI via `gradio`

2.3.5. Example of Asking the Detail in my CV:

2.4. Define Schema for Chat Client's Response via `pydantic`

2.5. System Prompt can be Dynamic

3. Agents and Tools

3.1. Define Tools

3.2. Apply the Tools

4. Reference

Contents

Contents

1. openai.AzureOpenAI: Deploy an LLM Model via Azure

1.1. Signature to Instantiate an AzureOpenAI model

1.2. Steps to Obtain Deployed Model and Necessary Credentials

2. Chat Client

2.1. Simple Chat Demonstration

2.2. System Prompt and User Prompt

2.3. Example: Read my Resume (pdf format) and answer the Question in that Resume

2.3.1. Preparation for Background Data

2.3.2. Preparation for System Prompt

2.3.3. Inject User chat Message as a User Prompt

2.3.4. Chat UI via gradio

2.3.5. Example of Asking the Detail in my CV:

2.4. Define Schema for Chat Client's Response via pydantic

2.5. System Prompt can be Dynamic

3. Agents and Tools

3.1. Define Tools

3.2. Apply the Tools

4. Reference

1. `openai.AzureOpenAI`: Deploy an LLM Model via Azure

1.1. Signature to Instantiate an `AzureOpenAI` model

2.3.4. Chat UI via `gradio`

2.4. Define Schema for Chat Client's Response via `pydantic`