What is Data Science?

Data Science is an inter-disciplinary field that incorporates various mathematical tools, algorithms, and machine learning principles in order to recognize patterns and make predictions about the future. In other words, data science can be considered the intersection between mathematics, computer science and business. The primary goal for a data scientist is to uncover hidden truths and patterns that lie within raw, unfiltered data.

What Even is Data?

Data is any piece of information collected -- both qualitative and quantitative -- that represents a particular attribute or trait. It can be as simple as your eye color or as complex as the Gross Domestic Product of the United States during a given time interval.

Data Collection -- the act of gathering data into one source -- is just one of the tasks that a data scientist has to carry out. Other include data cleaning, building predictive models, and creating powerful data visualizations.

Oftentimes, data scientists will follow the Data Science Lifecycle; this process is composed of the following steps:

The Power of Data Science

Over the past ten years, data science has skyrocketed in relevance thanks to advancements in machine learning and artificial intelligence. Today, the effects of data science can be felt across the globe -- and even above it (SpaceX, NASA)! More than that, the skills of data science can be applied to virtually every discipline, including but not limited to:

  • Transportation and Travel (Uber, Tesla)

  • Banking and E-Commerce (Fraud Protection, Predictive Analytics)

  • Healthcare and Medicine (Disease Detection and Mapping)

  • Professional Sports (Saber-metrics in MLB)

  • Entertainment (YouTube, Netflix, Spotify Recommendations)

Whether you think about it or not, we all interact with applications of data science, virtually on a daily basis. This textbook will provide you with the necessary tools to start making an impact yourself!

Last updated