Python: Prompt injection in OpenAI clients #21141

mbaluda · 2026-01-09T17:34:35Z

This pull request introduces a new CodeQL query for detecting prompt injection vulnerabilities in Python code targeting AI prompting APIs such as agents and openai. The changes includes a new experimental query, new taint flow and type models, a customizable dataflow configuration, documentation, and comprehensive test coverage.

…pt-injection

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

update

…pt-injection

* Add testcase and coverage for agents sdk runner run with input param * Rename agent sdk module for clarity * Add case for unnamed param use in runner run from agent sdk

github-actions · 2026-01-09T17:35:26Z

QHelp previews:

python/ql/src/experimental/Security/CWE-1427/PromptInjection.qhelp

Prompt injection

Prompts can be constructed to bypass the original purposes of an agent and lead to sensitive data leak or operations that were not intended.

Recommendation

Sanitize user input and also avoid using user input in developer or system level prompts.

Example

In the following examples, the cases marked GOOD show secure prompt construction; whereas in the case marked BAD they may be susceptible to prompt injection.

from flask import Flask, request
from agents import Agent
from guardrails import GuardrailAgent

@app.route("/parameter-route")
def get_input():
    input = request.args.get("input")

    goodAgent = GuardrailAgent(  # GOOD: Agent created with guardrails automatically configured.
        config=Path("guardrails_config.json"),
        name="Assistant",
        instructions="This prompt is customized for " + input)

    badAgent = Agent(
        name="Assistant",
        instructions="This prompt is customized for " + input  # BAD: user input in agent instruction.
    )

References

OpenAI: Guardrails.
Common Weakness Enumeration: CWE-1427.

python/ql/lib/semmle/python/frameworks/OpenAI.qll

Copilot

Pull request overview

This pull request introduces a new experimental CodeQL query to detect prompt injection vulnerabilities in Python code that uses AI/LLM APIs, specifically targeting the openai and agents libraries. The implementation adds security analysis capabilities for identifying where user-controlled input flows into AI prompts without proper sanitization.

Key changes:

New experimental query py/prompt-injection with dataflow configuration to track tainted data from remote sources to AI prompt sinks
Framework models for OpenAI and agents SDK to identify prompt construction patterns
New AIPrompt concept in the core library to model AI prompting operations

Reviewed changes

Copilot reviewed 15 out of 15 changed files in this pull request and generated 10 comments.

Show a summary per file

File	Description
`python/ql/src/experimental/Security/CWE-1427/PromptInjection.ql`	Main query definition using taint tracking to detect prompt injection
`python/ql/src/experimental/Security/CWE-1427/PromptInjection.qhelp`	Documentation and examples for the query
`python/ql/src/experimental/Security/CWE-1427/examples/example.py`	Example code demonstrating good and bad practices
`python/ql/lib/semmle/python/security/dataflow/PromptInjectionQuery.qll`	Dataflow configuration for prompt injection detection
`python/ql/lib/semmle/python/security/dataflow/PromptInjectionCustomizations.qll`	Sources, sinks, and sanitizers for prompt injection
`python/ql/lib/semmle/python/frameworks/OpenAI.qll`	Models for OpenAI and agents SDK APIs
`python/ql/lib/semmle/python/frameworks/openai.model.yml`	MaD model for OpenAI sink and type definitions
`python/ql/lib/semmle/python/frameworks/agent.model.yml`	MaD model for agents SDK sink definitions
`python/ql/lib/semmle/python/Frameworks.qll`	Integration of OpenAI framework into main frameworks module
`python/ql/lib/semmle/python/Concepts.qll`	New `AIPrompt` concept for modeling AI prompting operations
`python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/openai_test.py`	Test cases for OpenAI prompt injection patterns
`python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/agent_instructions.py`	Test cases for agents SDK prompt injection patterns
`python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/PromptInjection.qlref`	Test query reference configuration
`python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/PromptInjection.expected`	Expected test results
`python/ql/lib/change-notes/2026-01-02-prompt-injection.md`	Release notes for the new feature

Comments suppressed due to low confidence (11)

python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/agent_instructions.py:35

There is inconsistent spacing in the inline test annotation. The annotation should be # $Alert[py/prompt-injection] with a space after #, consistent with the annotations on other lines in the file.

                "content": input, # $Alert[py/prompt-injection]

python/ql/lib/semmle/python/frameworks/OpenAI.qll:4

The comment text "openAI" should use consistent capitalization. The official product name is "OpenAI" (capital O and capital AI).

 * As well as the regular openai python interface.

python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/agent_instructions.py:25

There is inconsistent spacing in the inline test annotation. The annotation should be # $Alert[py/prompt-injection] with a space after #, consistent with the annotations on other lines in the file.

                "content": input, # $Alert[py/prompt-injection]

python/ql/src/experimental/Security/CWE-1427/PromptInjection.qhelp:16

The phrase "the case marked BAD they may be" is missing a conjunction. It should be "the case marked BAD, they may be" (with comma) or "the cases marked BAD are" for better grammatical flow.

<p>In the following examples, the cases marked GOOD show secure prompt construction; whereas in the case marked BAD they may be susceptible to prompt injection.</p>

python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/agent_instructions.py:30

Variable result2 is not used.

    result2 = Runner.run_sync(

python/ql/src/experimental/Security/CWE-1427/examples/example.py:14

Variable badAgent is not used.

    badAgent = Agent(

python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/openai_test.py:21

Variable response2 is not used.

    response2 = client.responses.create(

python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/openai_test.py:40

Variable response3 is not used.

    response3 = await async_client.responses.create(

python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/openai_test.py:59

Variable completion1 is not used.

    completion1 = client.chat.completions.create(

python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/openai_test.py:76

Variable completion2 is not used.

    completion2 = azure_client.chat.completions.create(

python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/openai_test.py:89

Variable assistant is not used.

    assistant = client.beta.assistants.create(

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/openai_test.py

python/ql/lib/semmle/python/frameworks/OpenAI.qll

python/ql/src/experimental/Security/CWE-1427/examples/example.py

python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/agent_instructions.py

python/ql/src/experimental/Security/CWE-1427/PromptInjection.qhelp

python/ql/src/experimental/Security/CWE-1427/examples/example.py

python/ql/lib/semmle/python/frameworks/OpenAI.qll

python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/agent_instructions.py

python/ql/src/experimental/Security/CWE-1427/examples/example.py

python/ql/test/experimental/query-tests/Security/CWE-1427-PromptInjection/openai_test.py

…ptInjection/openai_test.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

python/ql/lib/semmle/python/frameworks/OpenAI.qll

+ *
+ * See https://github.com/openai/openai-agents-python.
+ */
+module AgentSDK {


yoff

Sorry for the long answer time.
This looks generally quite good. It fits our current structure, including the tests.
There are a few things I would change:

If it is an experimental query, then its associated models and concepts should also be in the experimental directory. There should be appropriate places for everything, but shout if it looks like not, and I can help.
#21134 is now merged, so you can probably move a lot of modeling into models-as-data. (If you do, you can probably ignore the code comments.)
it might be nice to clean up the commit history 😅

yoff · 2026-01-20T14:10:06Z

python/ql/lib/semmle/python/frameworks/OpenAI.qll

+  /** Gets a reference to the `openai.OpenAI` class. */
+  API::Node classRef() {
+    result =
+      API::moduleImport("openai").getMember(["OpenAI", "AsyncOpenAI", "AzureOpenAI"]).getReturn()


It looks like you already have these in MaD, can you not just reuse those?

classRef is used in getContentNode so that we can use some logic to only mark as sink the innermost element in the object structure

python/ql/lib/semmle/python/frameworks/OpenAI.qll

knewbury01 and others added 27 commits December 12, 2025 17:41

Add first version prompt injection query python openai agents sdk

005db5b

Add support for openai.OpenAI client library

7a9e03d

Merge branch 'knewbury01/add-prompt-injection-query-python' into prom…

b30444b

…pt-injection

Fix projcet build errors

6c5c87e

Fix newline at end of PromptInjection.qlref

616698c

Update python/ql/lib/semmle/python/frameworks/OpenAI.qll

942834d

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update python/ql/src/Security/CWE-1427/PromptInjection.ql

df979da

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Add example to qlhelp

bacecb7

Fix missing predicate

a9d0a16

Une inline expectations

04193f4

Use models as data

2c83dc3

Update python/ql/src/Security/CWE-1427/examples/example.py

0c7996e

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update python/ql/lib/semmle/python/Concepts.qll

21a2146

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update python/ql/src/Security/CWE-1427/PromptInjection.qhelp

7d450c5

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update python/ql/lib/change-notes/2026-01-02-prompt-injection.md

c352ffd

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Fix capitalization typo

9ea0a12

QLdoc

fd8e170

Merge branch 'main' into knewbury01/add-prompt-injection-query-python

b4275e8

Merge pull request #4 from github/main

4117252

update

Merge branch 'knewbury01/add-prompt-injection-query-python' into prom…

c7d99a1

…pt-injection

precise models for experimental query

1a0feb4

removed spurious file

01b9fa2

remove test

29aad2e

Refactor openai model

0a36be1

Improve agents sdk modelling (#5)

dccaa84

* Add testcase and coverage for agents sdk runner run with input param * Rename agent sdk module for clarity * Add case for unnamed param use in runner run from agent sdk

Update OpenAI.qll

1ec82d9

Merge branch 'github:main' into prompt-injection

3c14266

mbaluda requested a review from a team as a code owner January 9, 2026 17:34

Copilot AI review requested due to automatic review settings January 9, 2026 17:34

mbaluda added the documentation label Jan 9, 2026

mbaluda added the Python label Jan 9, 2026

Copilot started reviewing on behalf of mbaluda January 9, 2026 17:35 View session

github-advanced-security bot found potential problems Jan 9, 2026

View reviewed changes

python/ql/lib/semmle/python/frameworks/OpenAI.qll Fixed Show fixed Hide fixed

Copilot AI reviewed Jan 9, 2026

View reviewed changes

mbaluda and others added 2 commits January 9, 2026 19:07

Update python/ql/test/experimental/query-tests/Security/CWE-1427-Prom…

16370d6

…ptInjection/openai_test.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update python/ql/lib/semmle/python/frameworks/OpenAI.qll

4542681

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

github-advanced-security bot found potential problems Jan 9, 2026

View reviewed changes

python/ql/lib/semmle/python/frameworks/OpenAI.qll

*

* See https://github.com/openai/openai-agents-python.

*/

module AgentSDK {

Check warning

Code scanning / CodeQL

Acronyms should be PascalCase/camelCase Warning

Acronyms in AgentSDK should be PascalCase/camelCase.

yoff requested changes Jan 22, 2026

View reviewed changes

mbaluda added 2 commits January 22, 2026 15:58

Merge branch 'main' into prompt-injection

afb8702

Fix formatting

4c88fc5

Python: Prompt injection in OpenAI clients #21141

Are you sure you want to change the base?

Python: Prompt injection in OpenAI clients #21141

Conversation

mbaluda commented Jan 9, 2026

Uh oh!

github-actions bot commented Jan 9, 2026

Prompt injection

Recommendation

Example

References

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Check warning

yoff left a comment

Choose a reason for hiding this comment

Uh oh!

yoff Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

mbaluda Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mbaluda Jan 22, 2026 •

edited

Loading