Skip to main content

Big Data and Ethics

Big Data is not precisely a new trend, but the latest advances in computing capacity have set the stage for its rise. The hype and the reality of these new developments raise ethical issues that demand deliberation.


I came across an interesting white paper titled Perspectives of Big Data, Ethics, and Society, by the Council for Big Data, Ethics, and Society, that raises concerns about the obsolescence of the Common Rule (rule of ethics regarding research involving human subjects).

The Common Rule assumes that research methods using existing public datasets have no risk to individual human subjects. However, new data science techniques can create composite pictures of persons from different datasets that might be innocuous on their own but produce highly sensitive personal insights when combined. Since the informed consent occurs at the point of collection, before any data is used, it is not always possible to explain to the subject all the risks that the uses of his data might have with the current and future data analytics techniques.

In addition, the Common Rule protects individuals but it doesn't track the harms affecting communities when data is aggregated.

The Council offers the following recommendations:
  • Ensure the Common Rule clearly addresses regulation of data science. Ethics regulations should focus on what will be or could be done with datasets.
  • Seek ways to facilitate new approaches to ethics review inside academia and industry. Try new approaches that consider potential group harms in addition to individual harms.
  • Develop mechanism of ethical assessment calibrated to the practices of big data. Expand the analysis of the ethical implications of a system throughout the entire development and usage lifecycle (which is typically different in industry and academia).
  • Create and distribute high quality data ethics case studies that address difficulties faced by data scientists and practitioners. Case studies are a valuable pedagogical resource because they facilitate collaborative discussion.
  • Develop and support data science curricula with integrative approaches to ethics education. Ethics needs to be a cornerstone of big data education.
  • Strengthen ethics-oriented activities within professional associations. Ensure ethical commitments in research and practice at the professional association level.
  • Create hybrid spaces for ethics engagement. Treat networking and collaboration as necessary components of establishing ethics capacity.
  • Build models of internal and external ethics regulation bodies in industry. Without internal, external or legal repercussions, voluntary ethics review mechanisms could be difficult to enforce.
  • Set standards for responsible cross-sector data sharing. 
In this white paper the authors identify some challenging questions for future work, such as how to account for the risk of sharing datasets when we cannot know what auxiliary datasets they will be combined with in the future.

Popular posts from this blog

How to jump to time offsets in HTML5 video

Let's say that you have a 30-minute WEBM video file, from which you just want to play the following video segments , jumping from one to the other automatically  without interruptions : [00:01:25.00 - 00:02:25.00] -> from second 85 to 145 [00:11:40.00 - 00:11:55.00] -> from second 700 to 715 [00:20:26.00 - 00:21:07.00] -> from second 1226 to 1267 [00:26:11.00 - 00:28:01.00] -> from second 1571 to 1681 To increase the complexity, let's think that you have these video segments in a PHP variable $arrayVideoSegments  (normally the case if they were retrieved from the database).   $arrayVideoSegments[0]->startTime = 85   $arrayVideoSegments[0]->endTime = 145   $arrayVideoSegments[1]->startTime = 700   $arrayVideoSegments[1]->endTime = 715   $arrayVideoSegments[2]->startTime = 1226   $arrayVideoSegments[2]->endTime = 1267   $arrayVideoSegments[3]->startTime = 1571   $arrayVideoSegments[3]->endTime = 1681 The

5 learnings from a techie turned into a NFT artist

In September 2021 I chose to sell my crypto AI art business after two enriching (and often painful) years as a part-time sole founder. Today  AImade.art  is one of the best-selling AI art collections on Opensea . I want to share with you some of the key lessons I learned during this period: Work hard and get lucky.  And I got really lucky. On February 24th 2021 I had planned to shut down AImade.art , back then a business selling AI Art printed on canvas. I was discouraged after several months with no sales and my Shopify billing cycle was ending that day. Then something incredible happened: I missed the Shopify deadline and I sold an artwork one hour later. The buyer asked me: " Can I get it in as an NFT? ". I had absolutely no idea what an NFT was, but after a bit of research I found the concept so interesting that, two days later, I had pivoted the entire business to  NFT art made by AI . Sales started to pick up: I had finally found product-market fit . Impostor syndr

Learnings from "The 7 habits of highly effective people"

I just finished reading " The 7 habits of highly effective people ", a best-seller by Stephen R. Covey, that has inspired me in many levels. I am sharing some of the learnings I got, mostly as a personal bookmark, but hopefully this post can be useful for the community. Habit 1: Be Proactive It is not what happens to us, but our response to what happens to us that hurts us.  There is a space between stimulus and response, and the key to our growth and happiness is how we use that space. While reactive people feel victimized and out of control, proactive people have the power to choose how to respond to any circumstances (i.e., smiling with bad weather). We must focus our efforts on the things we can do something about, and accept what we can't change (past events, weather,...). Try replacing victimized language (i.e., " There is nothing I can do ", " I have to do it ",...) with proactive language (" Let's see all the options "