Implement SLF4Jm JUL, and Log4J2 providers for Java SDK by uranusjr · Pull Request #68725 · apache/airflow

uranusjr · 2026-06-18T17:53:54Z

Additional mechanism is added Logger in sdk.execution to support sending arbitrary log messages to LogSender. Once that's in place, we can implement various adapters to support common Java logging providers with a bit boilerplate code for each.

Those providers are implemented in separate artifacts since nobody needs all of them at once. The user can choose what they want. Documentation is added to describe how to do it.

I changed examples to prefer the Java Platform Logging interface instead of SLF4J too. This is built-in since Java 9. All the logging tools are more or less the same, so users should adapt pretty easily.

A new airflow-sdk-slf4j artifact is added to allow SLF4J to be seamlessly forwarded to Airflow's task log infrastructure. Use 'org.apache.airflow:airflow-sdk-slf4j' to enable this.

This allows log4j users to write logs into Airflow directly. Note that this uses Java since the appender is registered with an annotation, and the annotation processor can only handle Java. (There's a Kotlin bridge, but the class is small enough it doesn't make much sense to pull it in.)

This is too fancy for the spell checker. Use Dumb English instead.

Systen.log annoyingly pass a null as an array to a vararg method implemented in Kotlin. This breaks Kotlin since it expects an empty array instead. Fortunately, there seems to be a way to work around this, according to Claude? Let's hope this works.

uranusjr · 2026-06-18T21:48:59Z

Don’t know if it makes sense… since Airflow uses structlog, and the log forwarder actually works best with structured arguments, trying out intentionally NOT rendering the message but put the arguments in the dict instead

Is this too weird?

phanikumv · 2026-06-19T05:11:42Z

+      s.split(Regex("""[\s,]+""")).forEach {
+        val parts = it.split(Regex("""\s*=\s*"""), 2)
+        val level = parse(parts[1])
+        if (level != null) put(parts[0], level)


Suggested change

s.split(Regex("""[\s,]+""")).forEach {

val parts = it.split(Regex("""\s*=\s*"""), 2)

val level = parse(parts[1])

if (level != null) put(parts[0], level)

s.split(Regex("""[\s,]+""")).forEach {

if (it.isEmpty()) return@forEach

val parts = it.split(Regex("""\s*=\s*"""), 2)

if (parts.size != 2) return@forEach // <- guards parts[1]

val level = parse(parts[1])

if (level != null) put(parts[0], level)

}

The happy path works fine, however,

Say a deployment sets:

AIRFLOW__LOGGING__NAMESPACE_LEVELS="botocore=debug,"

A trailing comma — extremely common when people build up a comma list. Now trace it:

Step 1 — split "botocore=debug," on [\s,]+:

["botocore=debug", ""]

The trailing comma produces an empty-string token at the end.

Step 2 — the loop reaches the "" token and splits it on =:

"".split(Regex("""\s*=\s*"""), 2) -> [""] // size 1, no "=" to split on

Step 3 — parse(parts[1]) reads index 1 of a 1-element list:

java.lang.IndexOutOfBoundsException: Index 1 out of bounds for length 1

The same thing happens for "botocore" (a bare name, no =), " botocore=debug" (leading space), ",botocore=debug" (leading comma), or the env var set to "".

The example value in config.yml for this exact option is:

example: "sqlalchemy=INFO sqlalchemy.engine=DEBUG, botocor"

That last token, botocor, has no =level — so a user who copies the documented example verbatim hits the crash

The Python parser (structlog.py) has the same parsing gap, but it fails loudly with a ValueError at config time naming the bad value

That example is broken and got truncated in my PR that I added. It should fail with an error, but a nice one.

I’ll add some nicer errors.

The Python implementation currently also just use regex without error handling. Should we improve it too?

airflow/shared/logging/src/airflow_shared/logging/structlog.py

Lines 560 to 564 in d74fbff

if isinstance(namespace_log_levels, str):

log_from_level = partial(re.compile(r"\s*=\s*").split, maxsplit=2)

namespace_log_levels = {

log: level for log, level in map(log_from_level, re.split(r"[\s,]+", namespace_log_levels))

}

yes we should, but may be in a separate PR

uranusjr requested review from amoghrajesh, ashb, bugraoz93, gopidesupavan, jason810496, jscheffl and potiuk as code owners June 18, 2026 17:53

boring-cyborg Bot added the kind:documentation label Jun 18, 2026

uranusjr mentioned this pull request Jun 18, 2026

Fix Java-SDK logging level #68696

Closed

1 task

uranusjr added the AIP-108: java-sdk Change this to an 'area:' label after AIP acceptance. label Jun 18, 2026

uranusjr added this to the Java SDK 1.0 milestone Jun 18, 2026

uranusjr force-pushed the java-sdk-logging-providers branch from d73ec53 to 7c0f104 Compare June 18, 2026 18:01

uranusjr added 11 commits June 19, 2026 04:58

Implement SLF4J provider for SDK

7450734

A new airflow-sdk-slf4j artifact is added to allow SLF4J to be seamlessly forwarded to Airflow's task log infrastructure. Use 'org.apache.airflow:airflow-sdk-slf4j' to enable this.

Support java.util.logging with custom handler

2d9565c

Add documentation on logging in the Java SDK

9356f21

SDK tweaks for testing

d6bc640

Add tests for log providers

2386de8

Add E2E tests for logging

2118e5b

Add JPL provider and prefer it in all examples

4301b64

Implement environ-backed level filtering

d5f1fa1

Fix RAT checks

3c7a6ae

Fix 'façade' is doc

1940ee6

This is too fancy for the spell checker. Use Dumb English instead.

uranusjr force-pushed the java-sdk-logging-providers branch from f35217b to 3ac5294 Compare June 18, 2026 21:00

uranusjr added 2 commits June 19, 2026 05:05

Ensure log channel is open until task finishes

d48fe00

uranusjr force-pushed the java-sdk-logging-providers branch from 3ac5294 to d48fe00 Compare June 18, 2026 21:05

phanikumv reviewed Jun 19, 2026

View reviewed changes

Comment thread airflow-core/docs/authoring-and-scheduling/language-sdks/java.rst

phanikumv reviewed Jun 19, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement SLF4Jm JUL, and Log4J2 providers for Java SDK#68725

Implement SLF4Jm JUL, and Log4J2 providers for Java SDK#68725
uranusjr wants to merge 13 commits into
apache:mainfrom
astronomer:java-sdk-logging-providers

uranusjr commented Jun 18, 2026

Uh oh!

uranusjr commented Jun 18, 2026

Uh oh!

Uh oh!

phanikumv Jun 19, 2026

Uh oh!

phanikumv Jun 19, 2026

Uh oh!

ashb Jun 19, 2026

Uh oh!

uranusjr Jun 19, 2026

Uh oh!

phanikumv Jun 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	if isinstance(namespace_log_levels, str):
	log_from_level = partial(re.compile(r"\s=\s").split, maxsplit=2)
	namespace_log_levels = {
	log: level for log, level in map(log_from_level, re.split(r"[\s,]+", namespace_log_levels))
	}

Conversation

uranusjr commented Jun 18, 2026

Uh oh!

uranusjr commented Jun 18, 2026

Uh oh!

Uh oh!

phanikumv Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

phanikumv Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

ashb Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

uranusjr Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

phanikumv Jun 19, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants