[BI-2578][BI-2489] - Optimize BrAPI Germplasm Search #447

jloux-brapi · 2025-03-17T21:29:39Z

Description

Updated the BrAPIGermplasmDAO and the BrAPIDAOUtil to fetch all existing germplasm records at once for a program, without pagination. This will allow the BrAPI server to handle the request and it will resolve BI-2489 for Germplasm cache fetches. This feature is configurable via the new CACHE_PAGINATE_GERMPLASM env var, added to the template and application.yml. If the memory gets exhausted for a particular programs germplasm records, this var should be = true.
Updated the BrAPIDAOUtil so that for all other entities, it fetches the cache 65000 records at a time to potentially avoid any other SQL errors that could come up as a result of BI-2489. If this ends up being slow for these entities, we can potentially increase this amount, but a better solution IMO would be to get rid of the cache entirely for all entities and hit the BrAPI test server correctly with smaller pages so users only hit the test server when they need it (and hit it much less harder), with less data being transmitted. The max number of records allowable in a page fetch for the program cache is now configurable via the env var added CACHE_BRAPI_FETCH_PAGE_SIZE. Whatever the value of this is should match the paging.page-size.max-allowed application.property for the test server. If it's over the test server value, all requests will result in a 400.

Dependencies

This code is tied to this MR on the BrAPI Prod server. Once that code is merged, this code can be merged and the feature can be tested end to end.

I've also added new configurable variables, and created an MR for the docker stack with those same variables.

Testing

With a substantial database of germplasm records (more than 65k), start the application to load the cache. The cache should be able to load without fail thanks to the solving of BI-2489, and all germplasm data will be retrieved at once per program. There is a limit to this amount per program of 250k records. If clients get around that number, it will be time to move to a cacheless, request-based implementation of the germplasm fetch in the cache. (At that point, would recommend doing a cacheless impl for all entities).

There is other testing to be done with BI-2578, but that is more on the prod server side than bi-api so will leave it to you guys to look at that and test there.

…r for germs

mlm483

I tested locally with the BJTS changes and it works well, I was able to fetch 68k germplasm in 10 seconds after flushing the cache.

jloux-brapi added 2 commits March 14, 2025 14:14

[BI-2578] Support for unpaginated RQs to BrAPI server for cache loading

e6374e5

Add comment about configurable variable on brapi test server side

9c07244

jloux-brapi requested review from dmeidlin, mlm483 and nickpalladino March 17, 2025 21:29

github-actions bot added the feature label Mar 17, 2025

[BI-2578] Add config props for max rq page size and pagination flippe…

14c1ee3

…r for germs

jloux-brapi mentioned this pull request Mar 19, 2025

Add new BI-API config vars to docker stack Breeding-Insight/bi-docker-stack#55

Merged

jloux-brapi added 4 commits March 31, 2025 12:25

Add test server template props to bi-api

51c9f7f

Merge branch 'develop' into feature/BI-2578

6691029

Fix unit test errors related to new property not loading

bb359fd

Fix unit test failure related to searchNoPaging function

d970231

dmeidlin approved these changes Apr 3, 2025

View reviewed changes

nickpalladino approved these changes Apr 3, 2025

View reviewed changes

mlm483 approved these changes Apr 3, 2025

View reviewed changes

mlm483 self-assigned this Apr 3, 2025

mlm483 mentioned this pull request Apr 3, 2025

Germplasm Search Optimizations Breeding-Insight/brapi-Java-TestServer#49

Merged

jloux-brapi added 2 commits April 4, 2025 13:56

Merge branch 'develop' into feature/BI-2578

dc5c9ed

Merge branch 'develop' into feature/BI-2578

7212dc0

nickpalladino merged commit 5159c9d into develop Apr 16, 2025
1 check passed

nickpalladino deleted the feature/BI-2578 branch April 16, 2025 19:11

mlm483 mentioned this pull request May 13, 2025

BI-2489 - Could Not Prepare Statment Error Breeding-Insight/brapi-Java-TestServer#48

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BI-2578][BI-2489] - Optimize BrAPI Germplasm Search #447

[BI-2578][BI-2489] - Optimize BrAPI Germplasm Search #447

Uh oh!

jloux-brapi commented Mar 17, 2025 •

edited

Loading

Uh oh!

mlm483 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[BI-2578][BI-2489] - Optimize BrAPI Germplasm Search #447

[BI-2578][BI-2489] - Optimize BrAPI Germplasm Search #447

Uh oh!

Conversation

jloux-brapi commented Mar 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Dependencies

Testing

Uh oh!

mlm483 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

jloux-brapi commented Mar 17, 2025 •

edited

Loading