Deanonymization Risk Assessment Service

In this page you can have a detailed description of the Deanonymization Risk Assessment. Follow the instructions to experiment with the service through the Secured Innohub.

Description


The De-Anonymization Risk Assessment Decision Service is a web-based tool that allows data owners to evaluate the re-identification risk and privacy of anonymized datasets.

As a healthcare professional or data owner, you handle sensitive patient information daily. Protecting this data is not only a legal obligation under frameworks like GDPR and HIPAA but also a moral imperative to maintain patient trust. However, even when data is anonymized, there's a risk that individuals can be re-identified through sophisticated techniques. Understanding and mitigating these risks is crucial in today's data-driven world.


Our Data De-Anonymization Risk Assessment Tool is a web-based application designed to help you evaluate the re-identification risk and privacy of anonymized datasets. This tool is developed using Python and the Flask web framework for the backend, and employs HTML, CSS, and JavaScript for the frontend interface. It leverages advanced algorithms and methodologies derived from our extensive research, including a comprehensive summary review of de-anonymization attacks specifically targeting health data. By integrating documented attack vectors and known vulnerabilities into our risk assessment models, the tool provides a detailed evaluation of potential re-identification risks. It empowers users to identify potential threats by evaluating key dataset characteristics and cross-referencing them with known attack methods.

  • Framework and Languages: Developed using Python and Flask for the backend server, with HTML, CSS, and JavaScript for the frontend interface.
  • Risk Assessment Algorithms: The risk calculations are based on mathematical models that consider various dataset parameters, such as the uniqueness of data points, the presence of quasi-identifiers, and potential linkage opportunities.
  • Visualization: We use Matplotlib and other visualization libraries to generate risk assessment charts, providing a visual representation of the risk levels associated with different attack scenarios.
  • Extensibility: The modular design allows for easy updates and integration of new risk factors or attack types as research in the field evolves.


The tool is grounded in a comprehensive summary review of de-anonymization attacks in healthcare data. We analyzed numerous studies and documented cases to identify common attack methods, vulnerabilities, and risk factors. This research informs the tool's risk assessment algorithms, ensuring that the evaluations are based on real-world scenarios and the latest findings in data privacy research.

By using this tool, you can gain a deeper understanding of the specific risks associated with your datasets and receive tailored recommendations to enhance data protection measures.

Experimentation


This web service will be hosted on the Innohub platform and will be accessible via any standard web browser. Users can interact with the web service, and receive comprehensive de-anonymization assessments directly through the browser, with no need for additional installations or uploads of identifiable data to external servers. In addition to being available as a standalone service in the Innohub, we will liaise with the developers of the Anonymization tool to have the risk assessment service integrated within that tool, to provide a more organic user experience.

Please Login in order to have access to the installation material.