Read data in batches from solr using cursorMark #364

Ahmed-Salah6011 · 2024-12-31T11:47:01Z

When attempting to read documents in batches using cursorMark and rows parameters with the above code sample
val solrDF = spark.read .format("solr") .option("zkHost", zookeeperHosts) .option("collection", collectionName) .option("query", configuredSolrQuery) .option("rows", batchSize) .option("cursorMark", cursorMark) .option("wt" , "json") .option("sort","MSG_REF_UK_ID asc") .load()

the returned solrDF doens't contain the nextCursorMark returned by solr in the response which doesn't allow using this to load the data from solr in batches

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Read data in batches from solr using cursorMark #364

Read data in batches from solr using cursorMark #364

Ahmed-Salah6011 commented Dec 31, 2024

Read data in batches from solr using cursorMark #364

Read data in batches from solr using cursorMark #364

Comments

Ahmed-Salah6011 commented Dec 31, 2024