CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

A Novel Framework of Anonymization Techniques for Big Data Applications in Interactive Information Retrieval Systems

عنوان مقاله: A Novel Framework of Anonymization Techniques for Big Data Applications in Interactive Information Retrieval Systems
شناسه ملی مقاله: IIIRC02_001
منتشر شده در دومین کنفرانس ملی بازیابی تعاملی اطلاعات در سال 1398
مشخصات نویسندگان مقاله:

Mehdi Hasaninasab - ICT Research Institute (ITRC ) Information Technology Institute, ITRC, Tehran, Iran
Morteza Sargolzaei Javan - ICT Research Institute (ITRC ) Information Technology Institute, ITRC, Tehran, Iran
Ehsan Arianyan - ICT Research Institute (ITRC ) Information Technology Institute, ITRC, Tehran, Iran

خلاصه مقاله:
In the current digital world, organizations are transmitting and receiving data in different formats, rates, and technologies constantly. Organizations recognize the opportunities and business value offered by Big Data technologies. However there are many problems, which are derived from the difficulty of understanding the complex dimensions involved in Big Data adoption. Open data is one of the most important topics related to the big data domain. The benefit of open data on development of interactive information retrieval systems and consequently the overall economic growth of countries is undeniable. Prior to data release, they should be anonymized (e.g. through removing the owner of data) in order to avoid any privacy violation. Various anonymization techniques can be utilized which are different in their algorithms, speed, scalability, and de-identification risk. Choosing the proper technique for each big data application is a critical issue to reach the desired efficiency. This paper surveys the most important anonymization techniques and categorizes them into two main categories including randomization and generalization techniques. Moreover, this paper proposes a novel framework for choosing the best anonymization technique for six application categories including health, general data, finance, Geo information, social networks, and image data. Utilizing this framework helps designers to anonymize data efficiently before releasing them.

کلمات کلیدی:
Anonymization, Privacy, GDPR, Big data, Retrieval.

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/952672/