Data Science Project: An Inductive Learning Approach

Filipe Alves Neto Verri First Edition — v1.0.0 “Skyward Vector” — February 2026

Get it free on Leanpub →


About the Book

This book provides a structured exploration of the entire data science pipeline, from data collection to model deployment. It effectively balances theory and practice, focusing on the inductive principles underpinning predictive analytics and machine learning. Unlike texts that center on algorithms or specific tools, this book provides a holistic, language-agnostic view of every stage of a data science project.

Born from lecture notes for the graduate course PO-235 Data Science Project at ITA (Aeronautics Institute of Technology) and UNIFESP (Federal University of São Paulo), the book serves as a textbook for data science project courses and as a reference for professionals. It does not teach specific algorithms; instead, it explains why machine learning works, increasing awareness of its pitfalls and limitations.

The scope is deliberately focused on predictive and inductive methods, with deep attention to correct evaluation and validation of data science solutions. A solid mathematical and statistical foundation is expected from the reader.


Table of Contents

#ChapterDescription
1A Brief History of Data ScienceOrigins and evolution of the field
2Fundamental ConceptsCore definitions and theoretical foundations
3Data Science ProjectStructure and lifecycle of a DS project
4Structured DataRepresentations, types, and semantics of data
5Data HandlingCollection, storage, and data quality
6Learning from DataInductive learning principles and theory
7Data PreprocessingTransformation, normalization, and feature engineering
8Solution ValidationEvaluation protocols, metrics, and statistical tests
AMathematical FoundationsAppendix — Linear algebra, probability, and statistics

For Instructors: Companion Slides

Companion slide decks are available for classroom use. The slides follow the book’s structure and reproduce all original TikZ figures.

License: The slides are released under CC BY-NC 4.0 — you may adapt them for teaching purposes (more permissive than the book’s CC BY-NC-ND 4.0 license).

#Slide deckDownload
1A Brief History of Data SciencePDF
2Fundamental ConceptsPDF
3Data Science ProjectPDF
4Structured DataPDF
5Data HandlingPDF
6Data ExplorationPDF
7Learning from DataPDF
8Data PreprocessingPDF
9Solution ValidationPDF
AMathematical FoundationsPDF

LaTeX source for the slides is available on GitHub.


Citation

@book{verri2026datascienceproject,
  author    = {Verri, Filipe Alves Neto},
  title     = {Data Science Project: An Inductive Learning Approach},
  year      = 2026,
  publisher = {Leanpub},
  address   = {Victoria, British Columbia, Canada},
  doi       = {10.5281/zenodo.14498010},
  url       = {https://leanpub.com/dsp},
  note      = {Version v1.0.0}
}