Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

query_runner -> query_results: improve logging, handle unhandled data types #6905

Open
wants to merge 24 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from 12 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 3 additions & 1 deletion redash/query_runner/athena.py
Original file line number Diff line number Diff line change
Expand Up @@ -153,7 +153,9 @@ def type(cls):
return "athena"

def _get_iam_credentials(self, user=None):
if ASSUME_ROLE:
# Use the default credentials if iam_role is not provided
# 20 is the default botocore ParamValidation: Invalid length for parameter RoleArn, value: 0, valid min length: 20
if ASSUME_ROLE and len(self.configuration.get("iam_role")) >= 20:
role_session_name = "redash" if user is None else user.email
sts = boto3.client("sts")
creds = sts.assume_role(
Expand Down
15 changes: 14 additions & 1 deletion redash/query_runner/query_results.py
Original file line number Diff line number Diff line change
Expand Up @@ -109,9 +109,13 @@ def flatten(value):
return json_dumps(value)
elif isinstance(value, decimal.Decimal):
return float(value)
elif isinstance(value, datetime.timedelta):
elif isinstance(value, (datetime.date, datetime.time, datetime.datetime, datetime.timedelta)):
return str(value)
elif value is None:
return 'NULL'
else:
if not isinstance(value, (str, float, int)):
vtatarin marked this conversation as resolved.
Show resolved Hide resolved
logger.debug("flatten() found new type: %s", str(type(value)))
return value


Expand All @@ -134,10 +138,19 @@ def create_table(connection, table_name, query_results):
column_list=column_list,
place_holders=",".join(["?"] * len(columns)),
)
logger.debug("INSERT template: %s", insert_template)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pretty sure we don't want debugging statements being unconditionally run. At least with the change above it, it seems to be done conditionally there.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@justinclift thank you for the reply! So there are 2 things:

  • logger only logs (prints to stdout/stderr) messages, nothing is actually run/executed. The default log level is INFO, so logger.debug is actually a condition which logs the data only at a more verbose level
  • for another added logger there is a condition that checks the current log level and prevents going through too many checks unless its DEBUG (which is not default)

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@vtatarin Sorry, but I'm short on time for the next few weeks.

Am getting some (Redash related) stuff deployed to a data centre, and that's taking the majority of my focus time. When that's done I'll be able to look at PRs properly. 😄


for row in query_results["rows"]:
values = [flatten(row.get(column)) for column in columns]
# try:
# for value in values:
# logger.debug("Value: %s, Type: %s", str(value), str(type(value)))
connection.execute(insert_template, values)
# except Exception as e:
# if logger.isEnabledFor(logging.DEBUG):
# for value in values:
# logger.debug("Value: %s, Type: %s", str(value), str(type(value)))
# raise Exception("Error inserting data: %s", str(e))


def prepare_parameterized_query(query, query_params):
Expand Down
3 changes: 2 additions & 1 deletion redash/serializers/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -280,6 +280,7 @@ def serialize_job(job):
JobStatus.CANCELED: 5,
JobStatus.DEFERRED: 6,
JobStatus.SCHEDULED: 7,
JobStatus.STOPPED: 8
}

job_status = job.get_status()
Expand All @@ -301,7 +302,7 @@ def serialize_job(job):
error = job.result["error"]
status = 4
else:
error = ""
error = str(job.exc_info)
result = query_result_id = job.result

return {
Expand Down
Loading