SOLR-13350: multi-threaded search: replace cached with fixed threadpool #2508

cpoerschke · 2024-06-07T16:40:46Z

https://issues.apache.org/jira/browse/SOLR-13350

cpoerschke · 2024-06-07T16:44:46Z

solr/core/src/java/org/apache/solr/core/CoreContainer.java

    this.collectorExecutor =
-        ExecutorUtil.newMDCAwareCachedThreadPool(
+        ExecutorUtil.newMDCAwareFixedThreadPool(
            cfg.getIndexSearcherExecutorThreads(), // thread count
            cfg.getIndexSearcherExecutorThreads() * 1000, // queue size
            new SolrNamedThreadFactory("searcherCollector"));


https://docs.oracle.com/en/java/javase/11/docs/api/java.base/java/util/concurrent/ThreadPoolExecutor.html documentation says "If corePoolSize or more threads are running, the Executor always prefers queuing a request rather than adding a new thread." and "If a request cannot be queued, a new thread is created unless this would exceed maximumPoolSize, in which case, the task will be rejected."

I think this means that if the queue size is generous (relative to the use of the thread pool) then queuing will always be possible and search will remain single-threaded.

dsmiley · 2024-06-18T13:33:55Z

Can you recommend a way to benchmark this myself, like if I wanted to tweak it to see how my ideas work out?

cpoerschke · 2024-06-18T17:31:15Z

Can you recommend a way to benchmark this myself, like if I wanted to tweak it to see how my ideas work out?

I've shared the "just pass an executor" patch onto a Solr 9.5 cloud that I used, in case that's useful.

The benchmarking experiments I ran used Python's threading module and (I think under the hood) pysolr also, measuring QTime/elapsed time/CPU utilisation, for example. My experiments were for dense vector search, I haven't yet looked into if/how other searches might use the passed in executor at the Lucene level. But it would be worth mentioning at this point that the collection was quite multi-sharded i.e. that being independent of the type of search being run.

gus-asf · 2024-06-26T03:28:08Z

Along with this maybe we want to change

 public static final int DEFAULT_INDEX_SEARCHER_EXECUTOR_THREADS = 4;

to

public static final int DEFAULT_INDEX_SEARCHER_EXECUTOR_THREADS = Runtime.getRuntime().availableProcessors();

cpoerschke · 2024-07-18T17:17:08Z

Along with this maybe we want to change

 public static final int DEFAULT_INDEX_SEARCHER_EXECUTOR_THREADS = 4;

to

public static final int DEFAULT_INDEX_SEARCHER_EXECUTOR_THREADS = Runtime.getRuntime().availableProcessors();

Opened #2569 to separate that less controversial (?) change out from the less obvious (?) change here. Also there's some associated doc and config changes needed to go with making the default runtime aware.

solr/core/src/java/org/apache/solr/core/CoreContainer.java

…ol (#2508) (cherry picked from commit dfdbf85)

SOLR-13350: multi-threaded search: replace cached with fixed threadpool

23e3876

github-actions bot added the client:solrj label Jun 7, 2024

cpoerschke commented Jun 7, 2024

View reviewed changes

cpoerschke marked this pull request as ready for review June 7, 2024 16:58

cpoerschke mentioned this pull request Jul 18, 2024

SOLR-13350: multi-threaded search: default to 'available processors' thread count #2569

Merged

dsmiley approved these changes Jul 22, 2024

View reviewed changes

Merge branch 'main' into SOLR-13350-fixed-not-cached-threadpool

899a4de

cpoerschke commented Jul 23, 2024

View reviewed changes

solr/core/src/java/org/apache/solr/core/CoreContainer.java Outdated Show resolved Hide resolved

Update solr/core/src/java/org/apache/solr/core/CoreContainer.java

f05c311

cpoerschke merged commit dfdbf85 into apache:main Jul 23, 2024
4 checks passed

cpoerschke deleted the SOLR-13350-fixed-not-cached-threadpool branch July 23, 2024 16:42

asfgit pushed a commit that referenced this pull request Jul 23, 2024

SOLR-13350: multi-threaded search: replace cached with fixed threadpo…

cfec121

…ol (#2508) (cherry picked from commit dfdbf85)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SOLR-13350: multi-threaded search: replace cached with fixed threadpool #2508

SOLR-13350: multi-threaded search: replace cached with fixed threadpool #2508

cpoerschke commented Jun 7, 2024

cpoerschke Jun 7, 2024

dsmiley commented Jun 18, 2024

cpoerschke commented Jun 18, 2024

gus-asf commented Jun 26, 2024

cpoerschke commented Jul 18, 2024

SOLR-13350: multi-threaded search: replace cached with fixed threadpool #2508

SOLR-13350: multi-threaded search: replace cached with fixed threadpool #2508

Conversation

cpoerschke commented Jun 7, 2024

cpoerschke Jun 7, 2024

Choose a reason for hiding this comment

dsmiley commented Jun 18, 2024

cpoerschke commented Jun 18, 2024

gus-asf commented Jun 26, 2024

cpoerschke commented Jul 18, 2024