Module Indicative Content |
Relational Databases and Storage
Collecting Data. Data Storage. Data Modelling. Normalisation. Indexes. Query processing and optimisation. Database Performance Evaluation.
|
SQL for Data Retrieval
Outputting Data Streams. Complex Joins/Multi-Joins. Analytic Functions. Nested and Repeated Data. Writing Efficient Queries.
|
Data Cleaning
Handling Missing Values. Scaling and Normalization. Parsing Dates. Character Encodings. Inconsistent Data Entry.
|
Visualisation and Descriptive Statistics
Univariate, bivariate and multivariate data visualisation techniques such as Bar Charts, Boxplots, Heatmaps, Scatterplots, Histograms. Density plots. Measures of centrality and spread.
|
Regression & Correlation
Pearson's and Spearman Correlation, the Coefficient of Determination, Linear Regression Analysis.
|
Statistical Inference
Inferential Statistics, Confidence interval estimates & construction of appropriate hypothesis tests.
|
DKIT reserves the right to alter the nature and timings of assessment
Module Resources
|
Recommended Book Resources |
---|
-
James, G., Witten, D., Hastie, T., Tibshirani, R.. An Introduction to Statistical Learning: with Applications in R, Springer-Verlag New York Inc., [ISBN: 9781461471370].
|
This module does not have any article/paper resources |
---|
Other Resources |
---|
-
Website, Kaggle,
-
Website, Online Statistics Education: An
Interactive Multimedia Course of Study,
-
Website, Khan Academy,
-
Website, DataCamp,
|