Saltar al contenido principal
Al continuar, acepta nuestros Términos de uso, nuestra Política de privacidad y que sus datos se almacenan en los EE. UU.

Altavoces

Más información

¿Entrenar a 2 o más personas?

Obtenga acceso de su equipo a la biblioteca completa de DataCamp, con informes centralizados, tareas, proyectos y más
Pruebe DataCamp Para EmpresasPara obtener una solución a medida, reserve una demostración.

Data Literacy in the 21st Century

November 2021
Compartir

In this webinar, Dr. Hugo Bowne-Anderson, data scientist and educator at DataCamp and host of the podcast DataFramed, will talk you through what you need to know when you talk about data. Drawing on a series of articles he is writing for Harvard Business Review, Hugo breaks the conversation down into five areas: - data generation, collection and storage, - what data looks and feels like to data scientists and analysts, - statistical intuition and common statistical pitfalls, - model building, machine learning and AI, and - the ethics of data, big and small.

You can find the slides here.

Summary

The significance of being able to understand data in the 21st century is immense. As data becomes a key part of decision-making across sectors, knowing how data science influences our lives, and the basics of artificial intelligence (AI) and machine learning (ML) are critical. The conversation starts with defining data science as an interdisciplinary field that uses scientific methods to process data in various forms. This includes structured data like spreadsheets, image data, and unstructured data such as text. The impact of data science can be seen in sectors like healthcare, where AI algorithms outperform human experts in diagnosing diseases, and agriculture, where they predict crop yields. When discussing AI, it's clarified as a broad term for systems that make intelligent decisions, separate from the more futuristic concept of artificial general intelligence (AGI). The discussion also reflects on automation, emphasizing that while tasks may be automated, entire jobs may not be. Furthermore, the implications of personal data are explored, highlighting the large amounts of personal data collected by various devices and platforms, initiating a conversation around data ownership and privacy. The webinar also looks into the statistical intuition required to understand and interpret data, pointing out common biases such as the base rate fallacy. The ethical implications of data science are emphasized, with a call for an ethical framework similar to a Hippocratic Oath for data practitioners.

Key Takeaways:

  • Data literacy is essential for everyone as data informs decisions across sectors.
  • AI and machine learning are central to modern data science, but they require a nuanced understanding.
  • Automation impacts tasks more than entire jobs, shifting the focus to task efficiency.
  • Data privacy and ownership are critical issues, necessitating informed public discourse.
  • Statistical intuition is vital for interpreting data correctly, avoiding common biases.

Deep Dives

The Essence of Data Science

Data science is defined as a discipline th ...
Leer Mas

at combines scientific methods, processes, and algorithms to extract insights from structured and unstructured data. It draws from various fields, including statistics, computer science, and domain-specific knowledge, to solve complex data-driven problems. An important aspect discussed is the interdisciplinary nature of data science, which involves different types of data such as tabular (structured), image, and unstructured data like text and HTML. As Hugo Bowne-Anderson highlights, "Data scientists are like explorers, making discoveries as they swim through the vast ocean of data." The discussion emphasizes the importance of data science in industries, particularly in handling the large amounts of data generated in the wake of the big data revolution.

AI and Machine Learning: Distinctions and Applications

AI includes systems capable of making intelligent decisions, with machine learning as a subset focused on learning from data. The webinar clarifies the distinction between AI and the more speculative AGI, which involves sentient computational beings. Current AI applications range from diagnosing diseases to analyzing legal documents, demonstrating its impact across fields. Importantly, the session addresses common misconceptions about AI, stressing that it involves specialized algorithms rather than generalized intelligence. Hugo notes, "AI is about creating systems that can make intelligent decisions, not about creating sentient beings." The conversation extends to the implications of AI in job automation, highlighting the focus on task automation rather than entire job roles.

Data Privacy and Ownership

The collection and use of personal data by devices and platforms raise significant privacy concerns. In the digital age, where the Internet of Things (IoT) connects everyday objects, the data collected becomes more personal and pervasive. The discussion urges a conversation around data ownership, asking important questions about what data is collected, by whom, and for what purpose. The General Data Protection Regulation (GDPR) is cited as a legislative example aiming to protect individuals' privacy rights. Hugo emphasizes the need for transparency and understanding of data rights, stating, "As citizens, it's essential to recognize our rights regarding the privacy and security of our data."

Statistical Intuition and Bias

An understanding of statistical intuition is important to avoid biases and misinterpretations. The base rate fallacy is highlighted as a common error, where individuals neglect base rates when interpreting statistical data. An example is given with breathalyzer tests, where despite a positive result, the probability of actual intoxication is low due to the base rate. This section emphasizes the importance of statistical literacy in making informed decisions and interpretations. Hugo suggests, "Data science and statistics help us correct these intuitive errors, enabling better decision-making and understanding."

Ethics in Data Science

Data ethics is a significant area of discussion, focusing on the fairness and transparency of algorithms. Examples include the bias in recidivism risk scores, which have significant implications for parole decisions. The webinar calls for ethical standards similar to a Hippocratic Oath for data practitioners, advocating for algorithmic audits to ensure fairness. Kathy O'Neill's work on weapons of mass destruction is referenced, highlighting the potential for algorithms to perpetuate societal biases at scale. Hugo stresses the importance of ethical considerations, noting, "Data science doesn't just predict the future; it causes it. Our responsibility is to ensure fairness and transparency in these predictive models."


Relacionado

white paper

The L&D Guide to Data Literacy

Find the appropriate distribution of data skills across your organization.

white paper

What Your Employees Must Learn to Work With Data in the 21st Century

These are the topics and skills that employees must know to work with data.

webinar

What Your Employees Must Learn to Work With Data in the 21st Century

Here's how data can impact your employees' work and what they need to know.

webinar

The State of Data Literacy in 2023

Learn about what the future holds for data skills.

webinar

Be Data Literate: The Data Literacy Skills Everyone Needs to Succeed

Jordan Morrow outlines how people and organizations can be more data literate

webinar

Building Data Fluency in an Organization

Dive into the value of data fluency in an organization and how to achieve it.

Hands-on learning experience

Companies using DataCamp achieve course completion rates 6X higher than traditional online course providers

Learn More

Upskill your teams in data science and analytics

Learn More

Join 5,000+ companies and 80% of the Fortune 1000 who use DataCamp to upskill their teams.

Don’t just take our word for it.