How to optimize a query with a large number of hops #249

LianxinGao · 2022-08-11T01:26:18Z

Like cypher:

MATCH (:Country)<-[:IS_PART_OF]-(:City)<-[:IS_LOCATED_IN]-(:Person)<-[:HAS_MEMBER]-(:Forum)-[:CONTAINER_OF]->(:Post)<-[:REPLY_OF]-(:Comment)-[:HAS_TAG]->(:Tag)-[:HAS_TYPE]->(:TagClass)
RETURN count(*) AS count

This query has 7 hops, if each hop has 100 data, then total data will be (100)^7， it's a huge amount of data. Besides, currently, we will deserialize all the data's properties, so OOM will happen.....

The text was updated successfully, but these errors were encountered:

LianxinGao · 2022-08-11T01:45:08Z

My idea is:

optimize the query in lynx, if we only need to get the count number, then when move to the next hop, we can drop the previous data?
optimize the data structure, use lazy node and lazy relationship( do not deserialize the properties, just use the id...)

@chunyang-wen @zyang-tudb @xuanying2020 @terrytangyuan do your have any good ideas？

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to optimize a query with a large number of hops #249

How to optimize a query with a large number of hops #249

LianxinGao commented Aug 11, 2022

LianxinGao commented Aug 11, 2022 •

edited

Loading

How to optimize a query with a large number of hops #249

How to optimize a query with a large number of hops #249

Comments

LianxinGao commented Aug 11, 2022

LianxinGao commented Aug 11, 2022 • edited Loading

LianxinGao commented Aug 11, 2022 •

edited

Loading