-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GSoC Main Code Deliverables #54
Conversation
- Made a new pointer fitting function that is now able to recursively change parent pointers should the short delta not be enough. - This new function : `_try_fit_child_ptr` now replaces `_break_chunk_from`. - Tree is now able to handle larger sizes. - Will refactor other functions to use this helper now. - It reduces a bunch of code duplication.
… test suite which is for the `delete` methods.
…iverables have been added. Will benchmark the delete methods too. Independently found issues in the lhtree `delete_leaf` method.
WalkthroughThe changes involve updates to multiple files within the project, focusing on enhancements to tree data structure functionality, performance benchmarking, and correctness testing. New benchmark tests for various tree operations were added, alongside modifications to existing tests and files. Additionally, the Changes
Sequence Diagram(s)sequenceDiagram
participant User
participant Benchmark
participant Tree
User->>Benchmark: Initiate Benchmarking
Benchmark->>Tree: Construct Tree
Tree-->>Benchmark: Return Tree Structure
Benchmark->>Tree: Perform Traversals
Tree-->>Benchmark: Return Traversal Results
Benchmark-->>User: Display Performance Metrics
Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media? TipsChatThere are 3 ways to chat with CodeRabbit:
Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (invoked as PR comments)
Additionally, you can add CodeRabbit Configuration File (
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actionable comments posted: 6
Outside diff range, codebase verification and nitpick comments (28)
hhds/tests/wide_tree_correctness.cpp (2)
4-4
: Consider removing unused profiler header.The
#include <gperftools/profiler.h>
is currently not used in the code. Consider removing it unless profiling is planned for future use.
66-77
: Consider removing commented-out code.The commented-out profiling and print statements can be removed to clean up the code unless they are intended for future use.
hhds/tests/traversal_bench/preorder_bench/wide_tree_bench.cpp (2)
14-18
: Consider adding comments for utility functions.Adding comments to explain the purpose and usage of utility functions like
generate_random_int
can improve code readability and maintainability.
37-38
: Consider using a seed for the random generator.Using a fixed seed for the random generator can help achieve consistent results across different runs, which is useful for benchmarking.
hhds/tests/add_append_bench/wide_tree_bench.cpp (3)
13-34
: Remove or document commented-out code.The file contains a large block of commented-out code for a custom memory manager. Consider removing it if it's not needed, or add comments explaining its purpose and why it's commented out.
40-44
: Consider adding comments for utility functions.Adding comments to explain the purpose and usage of utility functions like
generate_random_int
can improve code readability and maintainability.
36-38
: Consider using a seed for the random generator.Using a fixed seed for the random generator can help achieve consistent results across different runs, which is useful for benchmarking.
hhds/tests/chip_typical_long_correctness.cpp (2)
68-85
: Consider adding comments for leaf collection functions.Adding comments to explain the purpose and usage of the
collect_leaves_hhds
andcollect_leaves_lhtree
functions can improve code readability and maintainability.
178-201
: Remove or document commented-out code.The file contains a block of commented-out code for leaf collection and deletion tests. Consider removing it if it's not needed, or add comments explaining its purpose and why it's commented out.
hhds/tests/add_append_bench/chip_typical_tree_bench.cpp (4)
14-18
: Consider adding error handling for random number generation.While the current implementation is straightforward, consider adding error handling or validation to ensure that
min
is less than or equal tomax
to prevent potential issues.int generate_random_int(std::default_random_engine& generator, int min, int max) { + if (min > max) { + throw std::invalid_argument("min should be less than or equal to max"); + } std::uniform_int_distribution<int> distribution(min, max); return distribution(generator); }
20-46
: Optimize tree building functions by removing unused variables.The
level_data
variable is declared but never used. Consider removing it to clean up the code.for (int depth = 0; depth < depth_val; ++depth) { std::vector<hhds::Tree_pos> hhds_next_level; - std::vector<std::vector<int>> level_data;
48-74
: Optimize tree building functions by removing unused variables.Similarly, in the
build_lh_tree
function, thelevel_data
variable is declared but never used. Consider removing it to clean up the code.for (int depth = 0; depth < depth_val; ++depth) { std::vector<lh::Tree_index> lh_next_level; - std::vector<std::vector<int>> level_data;
204-220
: Consider using consistent iteration counts for benchmarks.The iteration counts for the benchmarks are inconsistent. Consider standardizing them for a more uniform performance analysis.
BENCHMARK(test_chip_typical_tree_6_hhds); BENCHMARK(test_chip_typical_tree_6_lh); -BENCHMARK(test_chip_typical_tree_7_hhds)->Iterations(5); -BENCHMARK(test_chip_typical_tree_7_lh)->Iterations(5); -BENCHMARK(test_chip_typical_tree_8_hhds)->Iterations(5); -BENCHMARK(test_chip_typical_tree_8_lh)->Iterations(5); +BENCHMARK(test_chip_typical_tree_7_hhds); +BENCHMARK(test_chip_typical_tree_7_lh); +BENCHMARK(test_chip_typical_tree_8_hhds); +BENCHMARK(test_chip_typical_tree_8_lh);hhds/tests/add_append_bench/chip_typical_long_tree_bench.cpp (4)
14-18
: Consider adding error handling for random number generation.As in the previous file, consider adding error handling or validation to ensure that
min
is less than or equal tomax
to prevent potential issues.int generate_random_int(std::default_random_engine& generator, int min, int max) { + if (min > max) { + throw std::invalid_argument("min should be less than or equal to max"); + } std::uniform_int_distribution<int> distribution(min, max); return distribution(generator); }
20-46
: Optimize tree building functions by removing unused variables.The
level_data
variable is declared but never used. Consider removing it to clean up the code.for (int depth = 0; depth < depth_val; ++depth) { std::vector<hhds::Tree_pos> hhds_next_level; - std::vector<std::vector<int>> level_data;
48-74
: Optimize tree building functions by removing unused variables.Similarly, in the
build_lh_tree
function, thelevel_data
variable is declared but never used. Consider removing it to clean up the code.for (int depth = 0; depth < depth_val; ++depth) { std::vector<lh::Tree_index> lh_next_level; - std::vector<std::vector<int>> level_data;
204-220
: Consider using consistent iteration counts for benchmarks.The iteration counts for the benchmarks are inconsistent. Consider standardizing them for a more uniform performance analysis.
BENCHMARK(test_chip_typical_long_tree_6_hhds)->Iterations(3); BENCHMARK(test_chip_typical_long_tree_6_lh)->Iterations(3); -BENCHMARK(test_chip_typical_long_tree_7_hhds)->Iterations(4); -BENCHMARK(test_chip_typical_long_tree_7_lh)->Iterations(4); -BENCHMARK(test_chip_typical_long_tree_8_hhds); -BENCHMARK(test_chip_typical_long_tree_8_lh); +BENCHMARK(test_chip_typical_long_tree_7_hhds)->Iterations(3); +BENCHMARK(test_chip_typical_long_tree_7_lh)->Iterations(3); +BENCHMARK(test_chip_typical_long_tree_8_hhds)->Iterations(3); +BENCHMARK(test_chip_typical_long_tree_8_lh)->Iterations(3);hhds/tests/traversal_bench/preorder_bench/chip_typical_tree_bench.cpp (3)
14-18
: Consider adding error handling for random number generation.As in the previous files, consider adding error handling or validation to ensure that
min
is less than or equal tomax
to prevent potential issues.int generate_random_int(std::default_random_engine& generator, int min, int max) { + if (min > max) { + throw std::invalid_argument("min should be less than or equal to max"); + } std::uniform_int_distribution<int> distribution(min, max); return distribution(generator); }
20-46
: Optimize tree building functions by removing unused variables.The
level_data
variable is declared but never used. Consider removing it to clean up the code.for (int depth = 0; depth < depth_val; ++depth) { std::vector<hhds::Tree_pos> hhds_next_level; - std::vector<std::vector<int>> level_data;
48-74
: Optimize tree building functions by removing unused variables.Similarly, in the
build_lh_tree
function, thelevel_data
variable is declared but never used. Consider removing it to clean up the code.for (int depth = 0; depth < depth_val; ++depth) { std::vector<lh::Tree_index> lh_next_level; - std::vector<std::vector<int>> level_data;
hhds/temptree.cpp (8)
92-104
: Handle invalid indices more robustly.The
_get_first_child_s
method returnsINVALID
for invalid indices. Consider throwing an exception or logging an error to handle this case more explicitly.- default: return INVALID; + default: throw std::out_of_range("_get_first_child_s: Invalid index for first_child_s");
107-120
: Handle invalid indices more robustly.The
_set_first_child_s
method breaks for invalid indices without any action. Consider throwing an exception or logging an error to handle this case more explicitly.- default: - break; + default: throw std::out_of_range("_set_first_child_s: Invalid index for first_child_s");
124-136
: Handle invalid indices more robustly.The
_get_last_child_s
method returnsINVALID
for invalid indices. Consider throwing an exception or logging an error to handle this case more explicitly.- default: return INVALID; + default: throw std::out_of_range("_get_last_child_s: Invalid index for last_child_s");
139-152
: Handle invalid indices more robustly.The
_set_last_child_s
method breaks for invalid indices without any action. Consider throwing an exception or logging an error to handle this case more explicitly.- default: - break; + default: throw std::out_of_range("_set_last_child_s: Invalid index for last_child_s");
251-268
: Consider re-enabling exception handling.The exception handling code is commented out. Consider re-enabling it to handle cases where the tree size is exceeded or the parent index is out of range.
- // if (pointers_stack.size() >= MAX_TREE_SIZE) { - // throw std::out_of_range("_create_space: Tree size exceeded"); - // } else if (!_check_idx_exists(parent_index)) { - // throw std::out_of_range("_create_space: Parent index out of range"); - // }
271-293
: Consider re-enabling exception handling.The exception handling code is commented out. Consider re-enabling it to handle cases where the tree size is exceeded or the current index is out of range.
- // if (pointers_stack.size() >= MAX_TREE_SIZE) { - // throw std::out_of_range("_insert_chunk_after: Tree size exceeded"); - // } else if (!_check_idx_exists(curr)) { - // throw std::out_of_range("_insert_chunk_after: Current index out of range"); - // }
311-341
: Consider re-enabling exception handling.The exception handling code is commented out. Consider re-enabling it to handle cases where the parent or child index is out of range.
- // if (!_check_idx_exists(parent_id) || !_check_idx_exists(child_id)) { - // throw std::out_of_range("_try_fit_child_ptr: Index out of range"); - // }
1348-1353
: Remove or convert debug statements to logging.The debug statements are commented out. Consider removing them or converting them to proper logging for better maintainability.
- // std::cout << "BEFORE DELETTING : " << leaf_index << " ... \n"; - // for (short offset = 0; offset < CHUNK_SIZE; offset++) { - // std::cout << data_stack[(leaf_chunk_id << CHUNK_SHIFT) + offset].value_or(-1) << " "; - // } - // std::cout << "------------------------------------------------------\n";
Review details
Configuration used: CodeRabbit UI
Review profile: CHILL
Files ignored due to path filters (2)
graph1_dbg.png
is excluded by!**/*.png
graph_tiny1_dbg.png
is excluded by!**/*.png
Files selected for processing (17)
- .gitignore (1 hunks)
- hhds/BUILD (4 hunks)
- hhds/temptree.cpp (1 hunks)
- hhds/tests/add_append_bench/chip_typical_long_tree_bench.cpp (1 hunks)
- hhds/tests/add_append_bench/chip_typical_tree_bench.cpp (1 hunks)
- hhds/tests/add_append_bench/deep_tree_bench.cpp (1 hunks)
- hhds/tests/add_append_bench/wide_tree_bench.cpp (1 hunks)
- hhds/tests/chip_typical_correctness.cpp (6 hunks)
- hhds/tests/chip_typical_long_correctness.cpp (5 hunks)
- hhds/tests/deep_tree_correctness.cpp (1 hunks)
- hhds/tests/traversal_bench/preorder_bench/chip_typical_long_tree_bench.cpp (1 hunks)
- hhds/tests/traversal_bench/preorder_bench/chip_typical_tree_bench.cpp (1 hunks)
- hhds/tests/traversal_bench/preorder_bench/deep_tree_bench.cpp (1 hunks)
- hhds/tests/traversal_bench/preorder_bench/wide_tree_bench.cpp (1 hunks)
- hhds/tests/wide_tree_correctness.cpp (2 hunks)
- hhds/tree.hpp (26 hunks)
- hhds/tree_del.hpp (1 hunks)
Files skipped from review due to trivial changes (2)
- .gitignore
- hhds/tests/deep_tree_correctness.cpp
Additional context used
Learnings (1)
hhds/tree_del.hpp (1)
Learnt from: renau PR: masc-ucsc/hhds#51 File: hhds/tree_del.hpp:260-277 Timestamp: 2024-08-22T18:53:59.754Z Learning: For optimizing space allocation in data structures, prefer using `resize` over a loop to add multiple elements.
Additional comments not posted (66)
hhds/tests/wide_tree_correctness.cpp (1)
6-7
: LGTM! Header paths changed to absolute.The change from relative to absolute paths for header files is approved as it may improve clarity and maintainability.
hhds/tests/add_append_bench/deep_tree_bench.cpp (6)
1-11
: LGTM! Includes and random engine initialization.The includes and random engine initialization are appropriate and correctly implemented.
13-17
: LGTM! Utility function for random integer generation.The
generate_random_int
function is correctly implemented and useful for generating random data for the benchmarks.
19-35
: LGTM! Tree-building functions.The
build_hhds_tree
andbuild_lh_tree
functions are correctly implemented and serve their purpose in the benchmark tests.
37-147
: LGTM! Comprehensive benchmark tests for varying tree depths.The benchmark tests cover a wide range of tree depths, providing valuable insights into performance.
149-163
: LGTM! Benchmark registration.The benchmarks are correctly registered using the
BENCHMARK
macro.
165-166
: LGTM! Benchmark main function.The use of
BENCHMARK_MAIN()
is appropriate for executing the benchmarks.hhds/BUILD (1)
Line range hint
116-194
: LGTM! New and modifiedcc_test
definitions.The new and modified
cc_test
definitions enhance the testing framework by expanding the range of tests and refining existing ones.hhds/tests/traversal_bench/preorder_bench/wide_tree_bench.cpp (1)
183-200
: Ensure all benchmark tests are necessary.The file contains benchmark tests for trees with a wide range of node counts. Verify if all these tests are necessary and provide meaningful insights. Reducing the number of tests can save time and resources during benchmarking.
hhds/tests/add_append_bench/wide_tree_bench.cpp (1)
175-191
: Ensure all benchmark tests are necessary.The file contains benchmark tests for trees with a wide range of node counts. Verify if all these tests are necessary and provide meaningful insights. Reducing the number of tests can save time and resources during benchmarking.
hhds/tests/chip_typical_long_correctness.cpp (2)
5-6
: Verify include paths.Ensure that the include paths for
tree.hpp
andlhtree.hpp
are correct and necessary for the file's functionality.
Line range hint
101-129
: Ensure loop depth and child range are appropriate.The loop depth and child range in
test_chip_tree
have been modified. Verify that these changes are appropriate for the intended testing scenarios and do not introduce unintended issues.hhds/tests/traversal_bench/preorder_bench/chip_typical_long_tree_bench.cpp (8)
10-18
: LGTM: Utility functions and random generator.The utility functions and random number generation are implemented correctly.
20-46
: LGTM: HHDS tree building logic.The logic for building the hhds tree is sound and correctly implemented.
48-74
: LGTM: LH tree building logic.The logic for building the lh tree is sound and correctly implemented.
76-84
: LGTM: Preorder traversal for HHDS tree.The preorder traversal logic for the hhds tree is correctly implemented.
86-96
: LGTM: Preorder traversal for LH tree.The preorder traversal logic for the lh tree is correctly implemented.
98-240
: LGTM: Benchmark tests for trees with 1 to 8 nodes.The benchmark tests for trees with 1 to 8 nodes are correctly implemented.
242-257
: LGTM: Benchmark registration.The benchmarks are registered correctly. Consider verifying the commented-out benchmarks to ensure they are not needed.
258-259
: LGTM: Benchmark main function.The
BENCHMARK_MAIN
macro is correctly placed for running the benchmarks.hhds/tests/traversal_bench/preorder_bench/deep_tree_bench.cpp (9)
10-18
: LGTM: Utility functions and random generator.The utility functions and random number generation are implemented correctly.
20-27
: LGTM: HHDS tree building logic.The logic for building the hhds tree is sound and correctly implemented.
29-36
: LGTM: LH tree building logic.The logic for building the lh tree is sound and correctly implemented.
38-46
: LGTM: Preorder traversal for HHDS tree.The preorder traversal logic for the hhds tree is correctly implemented.
61-71
: LGTM: Preorder traversal for LH tree.The preorder traversal logic for the lh tree is correctly implemented.
89-96
: LGTM: Postorder traversal for HHDS tree.The postorder traversal logic for the hhds tree is correctly implemented.
98-107
: LGTM: Postorder traversal for LH tree.The postorder traversal logic for the lh tree is correctly implemented.
109-249
: LGTM: Benchmark tests for deep trees.The benchmark tests for deep trees are correctly implemented.
251-252
: LGTM: Benchmark main function.The
BENCHMARK_MAIN
macro is correctly placed for running the benchmarks.hhds/tests/chip_typical_correctness.cpp (6)
Line range hint
8-18
: LGTM: Random number generator and utility function.The random number generator and utility function are correctly implemented. Consider verifying the impact of the seed change on test outcomes.
Line range hint
22-35
: LGTM: Preorder traversal for HHDS tree with sibling data.The preorder traversal logic for the hhds tree, along with sibling data collection, is correctly implemented.
Line range hint
44-58
: LGTM: Preorder traversal for LH tree with sibling data.The preorder traversal logic for the lh tree, along with sibling data collection, is correctly implemented.
73-90
: LGTM: Leaf collection functions for HHDS and LH trees.The leaf collection functions are correctly implemented and provide useful functionality.
Line range hint
91-243
: LGTM: Chip tree correctness test.The chip tree correctness test is comprehensive and correctly implemented. The print statements aid in debugging.
Line range hint
246-250
: LGTM: Main function.The main function is straightforward and correctly implemented.
hhds/tree.hpp (18)
102-103
: Consider handling out-of-range indices more explicitly.Returning
INVALID
instead of throwing an exception may lead to silent failures if an invalid index is accessed. Consider logging a warning or using assertions to catch these cases during development.
117-118
: Consider handling out-of-range indices more explicitly.Breaking out of the control flow without any error indication may lead to silent failures. Consider logging a warning or using assertions to catch these cases during development.
133-134
: Consider handling out-of-range indices more explicitly.Returning
INVALID
instead of throwing an exception may lead to silent failures if an invalid index is accessed. Consider logging a warning or using assertions to catch these cases during development.
148-149
: Consider handling out-of-range indices more explicitly.Breaking out of the control flow without any error indication may lead to silent failures. Consider logging a warning or using assertions to catch these cases during development.
240-242
: Ensure indices are non-negative before calling_check_idx_exists
.The function assumes that indices are non-negative. Verify that this assumption holds in all calling code.
249-263
: Verify parent-child relationship integrity.The removal of parent index checks simplifies the function. Ensure that the parent-child relationship is maintained correctly in the calling code.
267-275
: Ensurecurr
is valid before calling_insert_chunk_after
.The function now takes a reference for
curr
, which improves performance. Verify thatcurr
is always valid in the calling code.
Line range hint
307-391
: Ensureparent_id
andchild_id
are valid before calling_try_fit_child_ptr
.The function now takes references for
parent_id
andchild_id
, which improves performance. Verify that both are valid in the calling code.
414-443
: Verify the usage and efficiency ofdelete_subtree
.The new method uses level-order traversal to delete a subtree. Ensure that it is used correctly and efficiently in the codebase.
426-428
: Consider handling invalid indices more explicitly inget_data
.Accessing data without checks may lead to undefined behavior if the index is invalid. Consider adding assertions or logging to catch these cases during development.
434-436
: Consider handling invalid indices more explicitly inget_data
(const version).Accessing data without checks may lead to undefined behavior if the index is invalid. Consider adding assertions or logging to catch these cases during development.
442-444
: Consider handling invalid indices more explicitly inset_data
.Setting data without checks may lead to undefined behavior if the index is invalid. Consider adding assertions or logging to catch these cases during development.
923-925
: Consider handling invalid indices more explicitly inget_parent
.Accessing parent without checks may lead to undefined behavior if the index is invalid. Consider adding assertions or logging to catch these cases during development.
940-942
: Consider handling invalid indices more explicitly inis_leaf
.Checking leaf status without checks may lead to undefined behavior if the index is invalid. Consider adding assertions or logging to catch these cases during development.
962-964
: Consider handling invalid indices more explicitly inget_last_child
.Accessing last child without checks may lead to undefined behavior if the index is invalid. Consider adding assertions or logging to catch these cases during development.
1019-1021
: Consider handling invalid indices more explicitly inget_first_child
.Accessing first child without checks may lead to undefined behavior if the index is invalid. Consider adding assertions or logging to catch these cases during development.
1052-1054
: Consider handling invalid indices more explicitly inis_last_child
.Checking last child status without checks may lead to undefined behavior if the index is invalid. Consider adding assertions or logging to catch these cases during development.
1088-1090
: Consider handling invalid indices more explicitly inis_first_child
.Checking first child status without checks may lead to undefined behavior if the index is invalid. Consider adding assertions or logging to catch these cases during development.
hhds/temptree.cpp (4)
63-91
: Verify bit field usage and alignment.The use of bit fields can lead to unexpected behavior if not handled carefully, especially with alignment and packing. Ensure that the alignment and packing are correctly applied and that the bit fields do not exceed the intended size.
498-538
: Verify edge case handling for iterators.Ensure that the
sibling_order_iterator
correctly handles edge cases, such as when the current node is invalid.
627-686
: Verify edge case handling for iterators.Ensure that the
pre_order_iterator
correctly handles edge cases, such as when the current node is invalid.
783-823
: Verify edge case handling for iterators.Ensure that the
post_order_iterator
correctly handles edge cases, such as when the current node is invalid.hhds/tree_del.hpp (9)
280-302
: LGTM!The method
_insert_chunk_after
is correctly implemented and follows the logic of updating sibling pointers.
320-413
: LGTM!The method
_try_fit_child_ptr
is well-implemented, handling both long and short pointers effectively.
363-447
: LGTM!The method
delete_leaf
is comprehensive and handles sibling and parent pointer updates effectively.
457-487
: LGTM!The method
delete_subtree
efficiently collects nodes for deletion and ensures leaves are deleted first.
1219-1252
: LGTM!The method
append_sibling
correctly handles cases where a new chunk is needed and updates parent pointers.
1264-1289
: LGTM!The method
insert_next_sibling
efficiently checks for available space and handles chunk insertion if needed.
1300-1321
: LGTM!The method
add_root
ensures the tree is empty before adding a root and correctly initializes the data and pointers stacks.
1333-1351
: LGTM!The method
add_child
handles both appending a sibling and creating a new child correctly.
982-1032
: LGTM!The method
get_last_child
efficiently uses chunk offsets and short deltas to determine the last child.
void build_hhds_tree(hhds::tree<int>& hhds_tree, int num_nodes) { | ||
auto hhds_root = hhds_tree.add_root(generate_random_int(generator, 1, 100)); | ||
|
||
for (int i = 0; i < num_nodes; ++i) { | ||
hhds_tree.add_child(hhds_root, generate_random_int(generator, 1, 100)); | ||
} | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Optimize tree-building loops.
The loops in build_hhds_tree
and build_lh_tree
could be optimized by preallocating space for the children if the number of nodes is known in advance. This can improve performance slightly by reducing dynamic memory allocations.
void build_hhds_tree(hhds::tree<int>& hhds_tree, int num_nodes) { | ||
auto hhds_root = hhds_tree.add_root(generate_random_int(generator, 1, 100)); | ||
|
||
for (int i = 0; i < num_nodes; ++i) { | ||
hhds_tree.add_child(hhds_root, generate_random_int(generator, 1, 100)); | ||
} | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Optimize tree-building loops.
The loops in build_hhds_tree
and build_lh_tree
could be optimized by preallocating space for the children if the number of nodes is known in advance. This can improve performance slightly by reducing dynamic memory allocations.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The benchmark intends to test and compare the performance of both trees while build appending, preallocating the space would be breaking the rules of the test.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ujjwal-shekhar, thanks for clarifying the intent behind the benchmark. I'll remember that preallocating space would alter the nature of the test, which is designed to evaluate the performance of tree-building operations as they naturally occur.
(\(\
( -.-)
o_(")(")
Learnings added
Learnt from: ujjwal-shekhar
PR: masc-ucsc/hhds#54
File: hhds/tests/add_append_bench/wide_tree_bench.cpp:46-52
Timestamp: 2024-08-23T19:56:08.107Z
Learning: In the `wide_tree_bench.cpp` benchmark, preallocating space for tree nodes is not performed to maintain the integrity of the performance test, which aims to evaluate tree-building operations as they naturally occur.
Note: Learnings are effective only in the context of similar code segments. To apply general coding guidelines, please configure review instructions. You can manage existing learnings in the app.
default: return INVALID; | ||
// throw std::out_of_range("_get_first_child_s: Invalid index for first_child_s"); | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Consider reintroducing exception handling.
The commented-out exception throws in the helper functions could provide useful error handling for invalid indices.
Consider reintroducing these exceptions to improve robustness:
default: throw std::out_of_range("_get_first_child_s: Invalid index for first_child_s");
Also applies to: 122-124, 138-140, 153-155
// if (pointers_stack.size() >= MAX_TREE_SIZE) { | ||
// throw std::out_of_range("_create_space: Tree size exceeded"); | ||
// } else if (!_check_idx_exists(parent_index)) { | ||
// throw std::out_of_range("_create_space: Parent index out of range"); | ||
// } |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reintroduce exception handling for robustness.
The commented-out exception throws in various methods could enhance error handling and robustness.
Consider reintroducing these exceptions to handle out-of-range errors effectively.
Also applies to: 281-285, 322-324, 447-449, 455-457, 464-465
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Exception handling has been removed on purpose after the -c opt
flag was used in benchmarking. Since the compiler cannot make decent optimizations due to stack unwinding issues. Maybe iassert or other things can be used instead of exceptions.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ujjwal-shekhar, thank you for the clarification. It's understandable that exception handling can interfere with compiler optimizations, especially when using the -c opt
flag. Exploring alternatives like iassert
or other lightweight error handling mechanisms could be a good approach to maintain robustness without compromising performance. If you need further assistance in implementing these alternatives, feel free to ask!
Learnings added
Learnt from: ujjwal-shekhar
PR: masc-ucsc/hhds#54
File: hhds/tree_del.hpp:261-265
Timestamp: 2024-08-23T19:55:04.894Z
Learning: Exception handling was removed in the `hhds/tree_del.hpp` file to allow for better compiler optimizations during benchmarking with the `-c opt` flag. Alternatives like `iassert` are being considered for error handling.
Note: Learnings are effective only in the context of similar code segments. To apply general coding guidelines, please configure review instructions. You can manage existing learnings in the app.
data_stack.emplace_back(data); | ||
for (int i = 0; i < CHUNK_MASK; i++) { | ||
data_stack.emplace_back(); | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Optimize space allocation using resize
.
Instead of using a loop to add multiple elements, consider using resize
for better performance and readability.
Apply this change to optimize space allocation:
data_stack.resize(data_stack.size() + CHUNK_MASK + 1);
Optimize space allocation using resize
.
Instead of using a loop to add multiple elements, consider using resize
for better performance and readability.
Apply this change to optimize space allocation:
data_stack.resize(data_stack.size() + CHUNK_MASK + 1);
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The tree_del.hpp
file has a version of the tree that also adds unsigned short last_occupied : CHUNK_SHIFT;
to the Tree_pointers chunk (appropriately adjusting SHORT_DELTA and CHUNK sizes to fit within 512 bytes). This was done to experiment and compare it's performance against finding the next occupied chunk using brute force.
This version was only left for future contributors to refer to or experiment with, it is not what is being benchmarked or used at any point, so making any optimizations to this version of the tree is not as useful.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ujjwal-shekhar, thank you for the clarification. I'll make a note that the tree_del.hpp
file is an experimental version meant for future contributors and not actively used or benchmarked. Therefore, optimizations in this file are not necessary.
Learnings added
Learnt from: ujjwal-shekhar
PR: masc-ucsc/hhds#54
File: hhds/tree_del.hpp:268-271
Timestamp: 2024-08-23T19:53:49.198Z
Learning: The `tree_del.hpp` file contains an experimental version of the tree structure, which is not actively used or benchmarked. It is meant for future contributors to refer to or experiment with.
Note: Learnings are effective only in the context of similar code segments. To apply general coding guidelines, please configure review instructions. You can manage existing learnings in the app.
Learnings used
Learnt from: renau
PR: masc-ucsc/hhds#51
File: hhds/tree_del.hpp:260-277
Timestamp: 2024-08-22T18:53:59.754Z
Learning: For optimizing space allocation in data structures, prefer using `resize` over a loop to add multiple elements.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actionable comments posted: 0
Review details
Configuration used: CodeRabbit UI
Review profile: CHILL
Files selected for processing (7)
- hhds/tests/add_append_bench/wide_tree_bench.cpp (1 hunks)
- hhds/tests/chip_typical_correctness.cpp (5 hunks)
- hhds/tests/chip_typical_long_correctness.cpp (3 hunks)
- hhds/tests/deep_tree_correctness.cpp (2 hunks)
- hhds/tests/traversal_bench/preorder_bench/chip_typical_long_tree_bench.cpp (1 hunks)
- hhds/tests/traversal_bench/preorder_bench/deep_tree_bench.cpp (1 hunks)
- hhds/tests/wide_tree_correctness.cpp (3 hunks)
Files skipped from review due to trivial changes (2)
- hhds/tests/deep_tree_correctness.cpp
- hhds/tests/wide_tree_correctness.cpp
Files skipped from review as they are similar to previous changes (5)
- hhds/tests/add_append_bench/wide_tree_bench.cpp
- hhds/tests/chip_typical_correctness.cpp
- hhds/tests/chip_typical_long_correctness.cpp
- hhds/tests/traversal_bench/preorder_bench/chip_typical_long_tree_bench.cpp
- hhds/tests/traversal_bench/preorder_bench/deep_tree_bench.cpp
Updated relative paths in all files.
Summary by CodeRabbit
New Features
Bug Fixes
Documentation
Chores