Working with Network Data

Authors

Why this book?

This book focuses on the practical side of network science — working with network data — to offer a useful guide for data scientists to use network science.

We hope that this book can help researchers in day-to-day tasks, starting from the very act of conceptualizing networks through to sophisticated network analysis, from exploratory analysis to statistical modeling and machine learning. At the same time, we also aim to give data scientists a foundational understanding of the tools, both mathematical and computational, at their disposal. The breadth and depth of statistical methods we can now use on network data is dizzying. We wish to take the prepared data scientist from their base knowledge of mathematics and statistics forward on a journey through the fundamentals of network data.

Table of Contents

Click here to see the Table of Contents

I. Background

  1. A whirlwind tour of network science
  2. Network data across fields
  3. Data ethics
  4. Primer

II. Applications, tools, and tasks

  1. The life cycle of a network study
  2. Gathering data
  3. Extracting networks from data — the “upstream task”
  4. Implementation: storing and manipulating network data
  5. Incorporating node and edge attributes
  6. Awful errors and how to amend them
  7. Explore and explain: statistics for network data
  8. Understanding network structure and organization
  9. Visualizing networks
  10. Summarizing and comparing networks
  11. Dynamics and dynamic networks
  12. Machine learning

Interlude — good practices for scientific computing

  1. Research record-keeping
  2. Data provenance
  3. Reproducible and reliable code
  4. Helpful tools

III. Fundamentals

  1. Networks demand network thinking: the friendship paradox
  2. Network models
  3. Statistical models and inference
  4. Uncertainty quantification and error analysis
  5. Ghost in the matrix: spectral methods for networks
  6. Embedding and machine learning
  7. Big data and scalability

Resources

Datasets: Focal networks

Errata