Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: Union and Union All Functionality for Combining Multiple Cubes #8143

Open
ZiggiZagga opened this issue Apr 14, 2024 · 0 comments

Comments

@ZiggiZagga
Copy link

Feature Request

Summary
Introduce a union and union_all key in the Cube configuration to allow for intuitive data blending across multiple cubes.

Motivation
Currently, combining data from multiple cubes requires custom query orchestration and application-layer code, which can be complex and time-consuming. By introducing a built-in union and union_all functionality, Cube can furthermore provide a more user-friendly approach to data blending, making it accessible to a wider range of users.

Detailed Explanation
The proposed feature would enable users to specify a list of cubes that should be combined using a new key in the Cube configuration file. Alongside this, users can define the dimensions that are common across these cubes, which are necessary for the union operation.

Here's an example of how the configuration might look:

union_all:
  - cube_list:
      - SalesCube
      - InventoryCube
      - ShippingCube
    union_dimensions:
      - date
      - product_id

In this configuration:

  • cube_list contains the names of the cubes to be combined.
  • union_dimensions lists the common dimensions to align the data from these cubes.

Benefits

  • Simplifies the process of combining data from multiple cubes.
  • Reduces the need for complex custom code in the application layer.
  • Makes data blending more accessible to non-technical users.

Considerations

  • The feature would need to handle the underlying data models and relationships between cubes carefully.
  • Performance implications of combining large datasets should be taken into account.

Potential Challenges

  • Ensuring the feature is flexible enough to handle various data modeling scenarios.
  • Maintaining the performance and efficiency of data retrieval and analysis.

I believe this feature would be a valuable addition to Cube, streamlining the data blending process and enhancing the overall user experience. I look forward to the community's thoughts on this proposal.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant