On our SaaS offering, we collect various identifiers when tracking an event via Snowplow to be able to track user journeys. We are pseudonymizing the data via hashing when collecting the data if the identifier could be used to personally identify a user.
Identifier | Pseudonymized | Example data | Pseudonymized data |
---|---|---|---|
user_id | Yes | 2890431 | Long hash |
project_id | No | 27005757 | 27005757 |
namespace_id | No | 12174719 | 12174719 |
page_url | Yes | https://gitlab.com/my-group/my-awesome-project | https://gitlab.com/group:123/project:356 |
page_url | Yes | https://gitlab.com/my-group/my-awesome-project/some_folder/some_file.js | https://gitlab.com/group:123/project:346/:repository_path |
page_url | Yes | https://gitlab.com/my-group/issues | https://gitlab.com/group:123/issues |
page_url | Yes | https://gitlab.com/checkout | https://gitlab.com/checkout |