Skip to content

Make Crawler queue in Azure separate from Azure results storage#591

Merged
elrayle merged 6 commits intoclearlydefined:masterfrom
ljones140:master
Aug 12, 2024
Merged

Make Crawler queue in Azure separate from Azure results storage#591
elrayle merged 6 commits intoclearlydefined:masterfrom
ljones140:master

Conversation

@ljones140
Copy link
Copy Markdown
Contributor

What

Makes the crawler queue configurable as a separate connection string from the Azure storage account.

By setting the env var connection string CRAWLER_QUEUE_AZURE_CONNECTION_STRING we can host the queues separately from the crawler.

Why

Currently anyone who has access to CD's Azure account can access the queues of anyone else who is hosting a crawler and submitting results to CD's azure storage account.

This could lead to security issues if sensitive data leaked into the crawler queues.
Anything could be added to the queue by external entities with with access to the CD storage account.

By adding a separate connection string we can host the queues separately
from the crawler
hopefully more intention revealing
@ljones140 ljones140 closed this Aug 12, 2024
@ljones140 ljones140 reopened this Aug 12, 2024
Copy link
Copy Markdown
Collaborator

@elrayle elrayle left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thanks

@elrayle elrayle merged commit a56c13d into clearlydefined:master Aug 12, 2024
elrayle added a commit that referenced this pull request Aug 12, 2024
Make Crawler queue in Azure separate from Azure results storage
@qtomlinson
Copy link
Copy Markdown
Collaborator

@ljones140 @elrayle Could you please also update documentation regarding the newly introduced environment variable?

@ljones140
Copy link
Copy Markdown
Contributor Author

@ljones140 @elrayle Could you please also update documentation regarding the newly introduced environment variable

Apologies for missing that, I will create a new PR

@ljones140
Copy link
Copy Markdown
Contributor Author

@qtomlinson PR on the operations documentation for the new env var clearlydefined/operations#88

@qtomlinson
Copy link
Copy Markdown
Collaborator

@ljones140 Thank you for keeping the docs up to date!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants