Skip to content

Remote partitioning with aggregate replies, the partition manager didn't notify the completion correctly if the manager app got restarted #4011

@sereysopheak

Description

@sereysopheak

Remote partitioning with aggregate replies, the partition manager didn't notify the job completion correctly if the manager app got restarted. I'm trying to configure it to use Kafka as the messaging channel. Everything is working fine if the manager keeps running while worker processing the job. At one point if manager instance is shut down and restarted, I can see the worker is still processing the job normally and replies are sent from the worker, but the manager itself didn't notify the job completion correctly even though the replies messages were consumed. And the status is stuck in STARTED with exit code UNKNOWN

Environment
Spring Batch version: 4.2.7
Java 11
database: MySQL

Steps to reproduce

  • Running the manager and worker on different instance
  • While worker is executing the job, restart the manager
  • Manager never update the status to complete/failed and the after job listener never got triggered.

Expected behavior
As it's done using the replies through messaging event, manager should be able to consume the replies message and reconcile the job status.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions