added support for YAML config of queries #56

richbenmintz · 2025-03-12T20:40:48Z

The Goal of this PR is to include support for YAML based query defintion files, for ease of source control and maintenance.

I have added support for YAML based query configuration and provided a sample yaml config file in the media section.
I have also modified file location to be a parameter and the appended the default lakehouse mount to support both pandas and file open for yaml

I have tested the code locally and it seems to work.

results of my test in following screen shot

richbenmintz · 2025-03-12T20:43:28Z

@microsoft-github-policy-service agree

DAXNoobJustin · 2025-03-13T13:35:28Z

Hey @richbenmintz,

This is a great idea! Love it.

Could you change the loading of the DAX queries to a function outside of the config cell? Hoping to keep that as simple as possible for the users.

Something like:

Config cell:

# Read DAX queries from the Excel or YAML file uploaded to the attached lakehouse
# The first column must be 'queryId' and additional columns should contain variants of the DAX query.
query_file_path = "Files/DAXQueries.xlsx"  # Path to the query file relative to the mount
query_file_mount_path = "/default"              # Mount location where the file is stored
query_worksheet_name = "DAXQueries"          # Worksheet name (for Excel files)

Helper function cell:

@log_function_calls
def load_dax_queries(file_path: str, mount_path: str, worksheet_name: str = None) -> pd.DataFrame:
    """
    Loads the DAX queries from the given file. Supports Excel and YAML formats.
    
    Args:
        file_path (str): Relative path to the query file.
        mount_path (str): The mount path where the file is stored.
        worksheet_name (str, optional): Worksheet name for Excel files.
        
    Returns:
        pd.DataFrame: DataFrame containing the DAX queries.
    """
    file_type = file_path.split('.')[-1].lower()
    full_path = f"{notebookutils.fs.getMountPath(mount_path)}/{file_path}"
    
    if file_type == 'xlsx':
        # Use the worksheet_name if provided, defaulting to the first sheet otherwise.
        return pd.read_excel(full_path, sheet_name=worksheet_name)
    elif file_type in ['yml', 'yaml']:
        with open(full_path, 'r') as f:
            data = yaml.load(f, Loader=yaml.FullLoader)
        return pd.DataFrame(data)
    else:
        raise ValueError(f"Unsupported file type: {file_type}")

Main run_dax_queries function

@log_function_calls
def run_dax_queries() -> None:
    """
    Main entry point for running all DAX queries from the Excel file.
    Manages the log table, capacity checks, and iterates over all queries and their combinations.
    """
    print("🚀 Starting all DAX queries")

    # Load the DAX queries using the configuration parameters.
    dax_queries = load_dax_queries(query_file_path, query_file_mount_path, query_worksheet_name)

richbenmintz · 2025-03-13T13:39:34Z

absolutely, great suggestion.

…

On Thu, Mar 13, 2025 at 10:35 AM Justin Martin ***@***.***> wrote: Hey @richbenmintz <https://github.com/richbenmintz>, This is a great idea! Love it. Could you change the loading of the DAX queries to a function outside of the config cell? Hoping to keep that as simple as possible for the users. Something like: Config cell: # Read DAX queries from the Excel or YAML file uploaded to the attached lakehouse # The first column must be 'queryId' and additional columns should contain variants of the DAX query. query_file_path = "Files/DAXQueries.xlsx" # Path to the query file relative to the mount query_file_mount_path = "/default" # Mount location where the file is stored query_worksheet_name = "DAXQueries" # Worksheet name (for Excel files) Helper function cell: @log_function_calls def load_dax_queries(file_path: str, mount_path: str, worksheet_name: str = None) -> pd.DataFrame: """ Loads the DAX queries from the given file. Supports Excel and YAML formats. Args: file_path (str): Relative path to the query file. mount_path (str): The mount path where the file is stored. worksheet_name (str, optional): Worksheet name for Excel files. Returns: pd.DataFrame: DataFrame containing the DAX queries. """ file_type = file_path.split('.')[-1].lower() full_path = f"{notebookutils.fs.getMountPath(mount_path)}/{file_path}" if file_type == 'xlsx': # Use the worksheet_name if provided, defaulting to the first sheet otherwise. return pd.read_excel(full_path, sheet_name=worksheet_name) elif file_type in ['yml', 'yaml']: with open(full_path, 'r') as f: data = yaml.load(f, Loader=yaml.FullLoader) return pd.DataFrame(data) else: raise ValueError(f"Unsupported file type: {file_type}") Main run_dax_queries function @log_function_calls def run_dax_queries() -> None: """ Main entry point for running all DAX queries from the Excel file. Manages the log table, capacity checks, and iterates over all queries and their combinations. """ print("🚀 Starting all DAX queries") # Load the DAX queries using the configuration parameters. dax_queries = load_dax_queries(query_file_path, query_file_mount_path, query_worksheet_name) — Reply to this email directly, view it on GitHub <#56 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AC7D67MIWQ54WB5P55RSA6T2UGCTNAVCNFSM6AAAAABY4SBAMOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOMRRGI4DGNJYHA> . You are receiving this because you were mentioned.Message ID: ***@***.***> [image: DAXNoobJustin]*DAXNoobJustin* left a comment (microsoft/fabric-toolbox#56) <#56 (comment)> Hey @richbenmintz <https://github.com/richbenmintz>, This is a great idea! Love it. Could you change the loading of the DAX queries to a function outside of the config cell? Hoping to keep that as simple as possible for the users. Something like: Config cell: # Read DAX queries from the Excel or YAML file uploaded to the attached lakehouse # The first column must be 'queryId' and additional columns should contain variants of the DAX query. query_file_path = "Files/DAXQueries.xlsx" # Path to the query file relative to the mount query_file_mount_path = "/default" # Mount location where the file is stored query_worksheet_name = "DAXQueries" # Worksheet name (for Excel files) Helper function cell: @log_function_calls def load_dax_queries(file_path: str, mount_path: str, worksheet_name: str = None) -> pd.DataFrame: """ Loads the DAX queries from the given file. Supports Excel and YAML formats. Args: file_path (str): Relative path to the query file. mount_path (str): The mount path where the file is stored. worksheet_name (str, optional): Worksheet name for Excel files. Returns: pd.DataFrame: DataFrame containing the DAX queries. """ file_type = file_path.split('.')[-1].lower() full_path = f"{notebookutils.fs.getMountPath(mount_path)}/{file_path}" if file_type == 'xlsx': # Use the worksheet_name if provided, defaulting to the first sheet otherwise. return pd.read_excel(full_path, sheet_name=worksheet_name) elif file_type in ['yml', 'yaml']: with open(full_path, 'r') as f: data = yaml.load(f, Loader=yaml.FullLoader) return pd.DataFrame(data) else: raise ValueError(f"Unsupported file type: {file_type}") Main run_dax_queries function @log_function_calls def run_dax_queries() -> None: """ Main entry point for running all DAX queries from the Excel file. Manages the log table, capacity checks, and iterates over all queries and their combinations. """ print("🚀 Starting all DAX queries") # Load the DAX queries using the configuration parameters. dax_queries = load_dax_queries(query_file_path, query_file_mount_path, query_worksheet_name) — Reply to this email directly, view it on GitHub <#56 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AC7D67MIWQ54WB5P55RSA6T2UGCTNAVCNFSM6AAAAABY4SBAMOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOMRRGI4DGNJYHA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

…R#56

richbenmintz · 2025-03-13T14:17:32Z

I have updated the code as suggested

DAXNoobJustin

Looks great!

rmintz-dstrat added 2 commits March 12, 2025 17:20

added support for YAML config of queries

b3bd6ac

removed excel tetsing file

abf906c

updated code to reflect changes requested by Justin Martin based on P…

a258f72

…R#56

DAXNoobJustin approved these changes Mar 13, 2025

View reviewed changes

itsnotaboutthecell merged commit 2b24402 into microsoft:main Mar 13, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

added support for YAML config of queries #56

added support for YAML config of queries #56

Uh oh!

richbenmintz commented Mar 12, 2025

Uh oh!

richbenmintz commented Mar 12, 2025

Uh oh!

DAXNoobJustin commented Mar 13, 2025

Uh oh!

richbenmintz commented Mar 13, 2025 via email

Uh oh!

richbenmintz commented Mar 13, 2025

Uh oh!

DAXNoobJustin left a comment

Uh oh!

Uh oh!

Uh oh!

added support for YAML config of queries #56

added support for YAML config of queries #56

Uh oh!

Conversation

richbenmintz commented Mar 12, 2025

Uh oh!

richbenmintz commented Mar 12, 2025

Uh oh!

DAXNoobJustin commented Mar 13, 2025

Uh oh!

richbenmintz commented Mar 13, 2025 via email

Uh oh!

richbenmintz commented Mar 13, 2025

Uh oh!

DAXNoobJustin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!