Hello,
I have a rather urgent issue with a hapi deployment. We need to execute the /$reindex operation and it was working fine on all test systems. The largest test system was 10GB in size and it took less than 20 minutes there, but on the production system it seems to be stuck for 6 days now.
It is a container deployment, based on the hapi-fhir starter project.
The DB size is roughly 100GB, there are the following resources (same resource types as in the test system):
- Appointment: 2M
- CodeSystem: 27 (but they are really huge)
- Encounter: 3.6M
- Immunization: 2.2M
- ImmunizationRecommendation: 300k
- Location: 400
- Observation: 800
- Organization: 400
- Patient: 360k
- Practitioner: 3k
- SearchParameter: 1
- StructureDefinition: 68
- Subscription: 20
We can see in the logs that the task is not completely idle:
2025-08-07 06:55:55.171 [hapi-fhir-jpa-scheduler-clustered-2] INFO c.u.fhir.log.batch_troubleshooting Job 51a7e3d6-f560-4369-8647-595b19ad005d of type REINDEX has status IN_PROGRESS - 0 records processed
Is it possible that the task is really not done yet? Is there something we can do to stop the task? Can we do something to make it more performant?
Thanks a lot,
cheers,
Johanna