Opportunity Through Data Textbook
  • Opportunity Through Data Textbook
  • Introduction
    • What is Data Science?
    • Introduction to Data Science: Exploratory Musical Analysis
  • Module 1
    • Introduction to Programming
      • The Command Line
      • Installing Programs
      • Python and the Command Line
      • Jupyter Notebook
    • Introduction to Python
      • Building Blocks of Python - Data Types and Variables
      • Functions
      • Formatting and Syntax
    • Math Review
      • Variables and Functions
      • Intro to Graphs
  • Module 2
    • Data Structures
      • Lists
      • Dictionaries
      • Tables
    • Programming Logic
      • Loops
      • Logical Operators
      • Conditionality
  • Module 3
    • Introduction to Probability
      • Probability and Sampling
    • Introduction to Statistics
      • Mean & Variance
      • Causality & Randomness
  • Module 4
    • Packages
    • Intro to NumPy
      • NumPy (continued)
  • Module 5
    • Introduction to Pandas
      • Introduction to Dataframes
      • Groupby and Join
    • Working with Data
    • Data Visualization
      • Matplotlib
      • Introduction to Data Visualization
  • Appendix
    • Table Utilities
    • Area of More Complicated Shapes
    • Introduction to Counting
    • Slope and Distance
    • Short Circuiting
    • Linear Regression
    • Glossary
  • Extension: Classification
    • Classification
    • Test Sets and Training Sets
    • Nearest Neighbors
  • Extension: Introduction to SQL
    • Introduction to SQL
    • Table Operations
      • Tables and Queries
      • Joins
  • Extension: Central Limit Theorem
    • Overview
    • Probability Distributions
      • Bernoulli Distribution
      • Uniform Distribution (Discrete)
      • Random Variables, Expectation, Variance
      • Discrete and Continuous Distributions
      • Uniform Distribution (Continuous)
      • Normal Distribution
    • Central Limit Theorem in Action
    • Confidence Intervals
  • Extension: Object-Oriented Programming
    • Object-Oriented Programming
      • Classes
      • Instantiation
      • Dot Notation
      • Mutability
  • Extension: Introduction to Excel
    • Introduction to Excel
      • Terminology and Interface
      • Getting Started with Analysis and Charts
      • Basics of Manipulating Data
    • Additional Features in Excel
      • Macros
      • The Data Tab
      • Pivot Tables
Powered by GitBook
On this page

Was this helpful?

  1. Module 1

Introduction to Programming

What is programming and what can it be used for?

KEY TERMS

  • computer programming: the process of creating instructions for a computer to execute

  • programming language: a computer language used by programmers to communicate with computers

  • Python: a programming language often used by data scientists to analyze and display data

Computer programming is the process of using a programming language to create instructions for a computer to execute. Like any other language, a programming language communicates information. While most languages are used to help people communicate with other people, programming languages are used to help people communicate with computers. Just as there are many different languages across the world, there are also several different types of programming languages.

Programming has many applications in the real world — some examples include commanding robots to do physical tasks or developing apps for smartphones. In this textbook, we will be covering one specific application: data science. Data scientists use programming languages to tell computers what they want them to do. For example, a data scientist could use a series of commands to tell a computer to display a graph based on a table of data, or they could tell a computer to look for mathematical patterns in the data. Computers are great at analyzing large amounts of data and can usually do so much faster than humans can; we just need to tell them how to analyze this data. We'll be teaching you all about the how throughout this textbook using Python, a programming language often used by data scientists.

PreviousIntroduction to Data Science: Exploratory Musical AnalysisNextThe Command Line

Last updated 4 years ago

Was this helpful?