Skip to main content

Data AnalyticsLaajuus (5 cr)

Code: YTIP2200

Credits

5 op

Teaching language

  • English

Responsible person

  • Pekka Varis

Objective

The student understands the significance of data analytics in the digitalizing operational environment. The student knows the most commonly used methods and theories of data analytics as well as how to apply them in practice to existing data and interpret the results of the methods.

Course competences
EUYEN EUR-ACE: Engineering Analysis, Master's Degree
EUYEE EUR-ACE: Engineering Design, Master's Degree
EUYIV EUR-ACE: Investigations, Master's Degree

Content

- Python data analytics libraries: NumPy, Pandas, Matplotlib, Seaborn, Scipy
- Data visualization
- Processing of missing values and outliers
- Statistical terms: Average, standard deviation, correlation coefficient and their interpretations
- The concept of probability distribution, confidence interval and hypothesis testing.
- Bernoulli and Poisson processes
- Linear/logistic regression, decision trees

Assessment criteria, satisfactory (1)

**Assessment criteria, sufficient 1, satisfactory 2
Sufficient 1:

The student knows about the most commonly used techniques in data analytics in data analysis tasks. He/she is able to apply the most common techniques to analysing data and has sufficient knowledge of the mathematics behind the techniques. Additionally, the student is able to assess his/her implementation briefly.

Satisfactory 2:
The student knows the most commonly used techniques in data analytics in data analysis tasks. He/she is able to select the techniques for analysing data and apply his/her technical know-how in practice. Student understands the mathematics behind the techniques at a satisfying level. Additionally, the student is able to assess his/her implementation superficially.

Assessment criteria, good (3)

Good 3:
The student is aware of the advantages of data analytics in the era of digitalization. The student knows the most commonly used techniques of data analytics in various data analysis tasks. Student understands well the mathematics behind the techniques at a good level. He/she is able to validate and select the techniques in data analysis and apply his/her technical know-how in practice. Additionally, the student is able to assess his/her implementation and validate its development.

Very good 4:
The student recognizes the advantages of data analytics in the era of digitalization. The student knows the most commonly used techniques of data analytics and is able to extensively validate the use of implemented techniques in various data analysis tasks. Student understands the mathematics behind the techniques at a very good level. He/she is able to versatilely validate and select the correct techniques for the analysis of data and apply his/her technical know-how to practice. Additionally, the student is able to assess his/her implementation profoundly and validate its development.

Assessment criteria, excellent (5)

Excellent 5:
The student recognizes the advantages of data analytics in the era of digitalization. The student knows the most commonly used techniques in data analytics and is able to critically validate the use of implemented techniques in various data analysis tasks. Student understands the mathematics behind the techniques in excellent level. He/she is able to critically validate and select the correct techniques in data analysis regardless of the data to be analyzed and apply the technical know-how to practice. Additionally, the student is able to critically assess his/her implementation and validate its development.

Enrollment

01.08.2024 - 31.08.2024

Timing

02.09.2024 - 18.12.2024

Number of ECTS credits allocated

5 op

Mode of delivery

Face-to-face

Unit

School of Technology

Teaching languages
  • English
Seats

0 - 35

Degree programmes
  • Master's Degree Programme in Artificial Intelligence and Data Analytics
Teachers
  • Harri Varpanen
Groups
  • YTI24S1
    Master's Degree Programme in Artificial Intelligence and Data-analytics

Objectives

The student understands the significance of data analytics in the digitalizing operational environment. The student knows the most commonly used methods and theories of data analytics as well as how to apply them in practice to existing data and interpret the results of the methods.

Course competences
EUYEN EUR-ACE: Engineering Analysis, Master's Degree
EUYEE EUR-ACE: Engineering Design, Master's Degree
EUYIV EUR-ACE: Investigations, Master's Degree

Content

- Python data analytics libraries: NumPy, Pandas, Matplotlib, Seaborn, Scipy
- Data visualization
- Processing of missing values and outliers
- Statistical terms: Average, standard deviation, correlation coefficient and their interpretations
- The concept of probability distribution, confidence interval and hypothesis testing.
- Bernoulli and Poisson processes
- Linear/logistic regression, decision trees

Learning materials and recommended literature

Online material.

Teaching methods

Online lectures (recorded), bi-weekly exercises with peer review.
One mandatory weekend campus meeting face-to-face: Fri Oct 25 - Sat Oct 26, 2024.

Exam dates and retake possibilities

No exams.

Alternative completion methods

Uni. Helsinki: Data Analysis with Python (MOOC), course code CSM90004

Content scheduling

Seven sets of theory / exercises:
1. Orientation (basics, numpy, matrices)
2. Data manipulation (pandas)
3. Data visualization (matplotlib & seaborn)
4. Time series (pmdarima)
5. Linear regression (scipy / sklearn)
6. Logistic regression (scipy / sklearn)
7. Dimension reduction (scipy / sklearn).

Further information for students

The grade is determined by the number of completed exercises.
Basic knowledge of python is assumed.

Evaluation scale

0-5

Evaluation criteria, satisfactory (1-2)

**Assessment criteria, sufficient 1, satisfactory 2
Sufficient 1:

The student knows about the most commonly used techniques in data analytics in data analysis tasks. He/she is able to apply the most common techniques to analysing data and has sufficient knowledge of the mathematics behind the techniques. Additionally, the student is able to assess his/her implementation briefly.

Satisfactory 2:
The student knows the most commonly used techniques in data analytics in data analysis tasks. He/she is able to select the techniques for analysing data and apply his/her technical know-how in practice. Student understands the mathematics behind the techniques at a satisfying level. Additionally, the student is able to assess his/her implementation superficially.

Evaluation criteria, good (3-4)

Good 3:
The student is aware of the advantages of data analytics in the era of digitalization. The student knows the most commonly used techniques of data analytics in various data analysis tasks. Student understands well the mathematics behind the techniques at a good level. He/she is able to validate and select the techniques in data analysis and apply his/her technical know-how in practice. Additionally, the student is able to assess his/her implementation and validate its development.

Very good 4:
The student recognizes the advantages of data analytics in the era of digitalization. The student knows the most commonly used techniques of data analytics and is able to extensively validate the use of implemented techniques in various data analysis tasks. Student understands the mathematics behind the techniques at a very good level. He/she is able to versatilely validate and select the correct techniques for the analysis of data and apply his/her technical know-how to practice. Additionally, the student is able to assess his/her implementation profoundly and validate its development.

Evaluation criteria, excellent (5)

Excellent 5:
The student recognizes the advantages of data analytics in the era of digitalization. The student knows the most commonly used techniques in data analytics and is able to critically validate the use of implemented techniques in various data analysis tasks. Student understands the mathematics behind the techniques in excellent level. He/she is able to critically validate and select the correct techniques in data analysis regardless of the data to be analyzed and apply the technical know-how to practice. Additionally, the student is able to critically assess his/her implementation and validate its development.

Enrollment

20.11.2023 - 04.01.2024

Timing

08.01.2024 - 30.04.2024

Number of ECTS credits allocated

5 op

Mode of delivery

Face-to-face

Unit

School of Technology

Teaching languages
  • English
Seats

20 - 35

Degree programmes
  • Master's Degree Programme in Artificial Intelligence and Data Analytics
Teachers
  • Harri Varpanen
Groups
  • YTI23S1
    Master's Degree Programme in Artificial Intelligence and Data-analytics

Objectives

The student understands the significance of data analytics in the digitalizing operational environment. The student knows the most commonly used methods and theories of data analytics as well as how to apply them in practice to existing data and interpret the results of the methods.

Course competences
EUYEN EUR-ACE: Engineering Analysis, Master's Degree
EUYEE EUR-ACE: Engineering Design, Master's Degree
EUYIV EUR-ACE: Investigations, Master's Degree

Content

- Python data analytics libraries: NumPy, Pandas, Matplotlib, Seaborn, Scipy
- Data visualization
- Processing of missing values and outliers
- Statistical terms: Average, standard deviation, correlation coefficient and their interpretations
- The concept of probability distribution, confidence interval and hypothesis testing.
- Bernoulli and Poisson processes
- Linear/logistic regression, decision trees

Learning materials and recommended literature

Online material.

Teaching methods

Online lectures (recorded) and exercise sessions.
Two mandatory campus meetings (Fri-Sat): Jan 12-13 and Mar 15-16, 2024

Exam dates and retake possibilities

No exams.

Alternative completion methods

Uni. Helsinki: Data Analysis with Python (MOOC), course code CSM90004

Content scheduling

Seven sets of theory / exercises:
1. Orientation (basics, numpy, matrices)
2. Data manipulation (pandas)
3. Data visualization (matplotlib & seaborn)
4. Time series (pmdarima)
5. Linear regression (scipy / sklearn)
6. Logistic regression (scipy / sklearn)
7. Dimension reduction (scipy / sklearn).

Further information for students

The grade is determined by the number of completed exercises.
Basic knowledge of python is assumed.

Evaluation scale

0-5

Evaluation criteria, satisfactory (1-2)

**Assessment criteria, sufficient 1, satisfactory 2
Sufficient 1:

The student knows about the most commonly used techniques in data analytics in data analysis tasks. He/she is able to apply the most common techniques to analysing data and has sufficient knowledge of the mathematics behind the techniques. Additionally, the student is able to assess his/her implementation briefly.

Satisfactory 2:
The student knows the most commonly used techniques in data analytics in data analysis tasks. He/she is able to select the techniques for analysing data and apply his/her technical know-how in practice. Student understands the mathematics behind the techniques at a satisfying level. Additionally, the student is able to assess his/her implementation superficially.

Evaluation criteria, good (3-4)

Good 3:
The student is aware of the advantages of data analytics in the era of digitalization. The student knows the most commonly used techniques of data analytics in various data analysis tasks. Student understands well the mathematics behind the techniques at a good level. He/she is able to validate and select the techniques in data analysis and apply his/her technical know-how in practice. Additionally, the student is able to assess his/her implementation and validate its development.

Very good 4:
The student recognizes the advantages of data analytics in the era of digitalization. The student knows the most commonly used techniques of data analytics and is able to extensively validate the use of implemented techniques in various data analysis tasks. Student understands the mathematics behind the techniques at a very good level. He/she is able to versatilely validate and select the correct techniques for the analysis of data and apply his/her technical know-how to practice. Additionally, the student is able to assess his/her implementation profoundly and validate its development.

Evaluation criteria, excellent (5)

Excellent 5:
The student recognizes the advantages of data analytics in the era of digitalization. The student knows the most commonly used techniques in data analytics and is able to critically validate the use of implemented techniques in various data analysis tasks. Student understands the mathematics behind the techniques in excellent level. He/she is able to critically validate and select the correct techniques in data analysis regardless of the data to be analyzed and apply the technical know-how to practice. Additionally, the student is able to critically assess his/her implementation and validate its development.

Enrollment

01.11.2022 - 05.01.2023

Timing

09.01.2023 - 28.04.2023

Number of ECTS credits allocated

5 op

Virtual portion

4 op

Mode of delivery

20 % Face-to-face, 80 % Online learning

Unit

School of Technology

Campus

Lutakko Campus

Teaching languages
  • English
Seats

0 - 35

Degree programmes
  • Master's Degree Programme in Artificial Intelligence and Data Analytics
Teachers
  • Harri Varpanen
Groups
  • YTI22S1
    Master's Degree Programme in Artificial Intelligence and Data-analytics

Objectives

The student understands the significance of data analytics in the digitalizing operational environment. The student knows the most commonly used methods and theories of data analytics as well as how to apply them in practice to existing data and interpret the results of the methods.

Course competences
EUYEN EUR-ACE: Engineering Analysis, Master's Degree
EUYEE EUR-ACE: Engineering Design, Master's Degree
EUYIV EUR-ACE: Investigations, Master's Degree

Content

- Python data analytics libraries: NumPy, Pandas, Matplotlib, Seaborn, Scipy
- Data visualization
- Processing of missing values and outliers
- Statistical terms: Average, standard deviation, correlation coefficient and their interpretations
- The concept of probability distribution, confidence interval and hypothesis testing.
- Bernoulli and Poisson processes
- Linear/logistic regression, decision trees

Learning materials and recommended literature

Online material.

Teaching methods

Online lectures (recorded) and exercise sessions.
Two campus meetings (Fri-Sat): Jan 13-14 and Mar 10-11, 2023

Exam dates and retake possibilities

No exams.

Alternative completion methods

Uni. Helsinki: Data Analysis with Python (MOOC), course code CSM90004

Content scheduling

Seven sets of theory / exercises:
1. Orientation (basics, numpy, matrices)
2. Data manipulation (pandas)
3. Data visualization (matplotlib & seaborn)
4. Time series (pmdarima)
5. Linear regression (scipy / sklearn)
6. Logistic regression (scipy / sklearn)
7. Dimension reduction (scipy / sklearn).

Further information for students

The grade is determined by the number of completed exercises.
Basic knowledge of python is assumed.

Evaluation scale

0-5

Evaluation criteria, satisfactory (1-2)

**Assessment criteria, sufficient 1, satisfactory 2
Sufficient 1:

The student knows about the most commonly used techniques in data analytics in data analysis tasks. He/she is able to apply the most common techniques to analysing data and has sufficient knowledge of the mathematics behind the techniques. Additionally, the student is able to assess his/her implementation briefly.

Satisfactory 2:
The student knows the most commonly used techniques in data analytics in data analysis tasks. He/she is able to select the techniques for analysing data and apply his/her technical know-how in practice. Student understands the mathematics behind the techniques at a satisfying level. Additionally, the student is able to assess his/her implementation superficially.

Evaluation criteria, good (3-4)

Good 3:
The student is aware of the advantages of data analytics in the era of digitalization. The student knows the most commonly used techniques of data analytics in various data analysis tasks. Student understands well the mathematics behind the techniques at a good level. He/she is able to validate and select the techniques in data analysis and apply his/her technical know-how in practice. Additionally, the student is able to assess his/her implementation and validate its development.

Very good 4:
The student recognizes the advantages of data analytics in the era of digitalization. The student knows the most commonly used techniques of data analytics and is able to extensively validate the use of implemented techniques in various data analysis tasks. Student understands the mathematics behind the techniques at a very good level. He/she is able to versatilely validate and select the correct techniques for the analysis of data and apply his/her technical know-how to practice. Additionally, the student is able to assess his/her implementation profoundly and validate its development.

Evaluation criteria, excellent (5)

Excellent 5:
The student recognizes the advantages of data analytics in the era of digitalization. The student knows the most commonly used techniques in data analytics and is able to critically validate the use of implemented techniques in various data analysis tasks. Student understands the mathematics behind the techniques in excellent level. He/she is able to critically validate and select the correct techniques in data analysis regardless of the data to be analyzed and apply the technical know-how to practice. Additionally, the student is able to critically assess his/her implementation and validate its development.

Enrollment

01.11.2021 - 09.01.2022

Timing

10.01.2022 - 29.04.2022

Number of ECTS credits allocated

5 op

Virtual portion

4 op

Mode of delivery

20 % Face-to-face, 80 % Online learning

Unit

School of Technology

Campus

Lutakko Campus

Teaching languages
  • English
Seats

0 - 35

Degree programmes
  • Master's Degree Programme in Artificial Intelligence and Data Analytics
Teachers
  • Tuula Kotikoski
  • Harri Varpanen
Teacher in charge

Mika Rantonen

Groups
  • YTI21S1
    Master's Degree Programme in Artificial Intelligence and Data-analytics
  • ZJA21STIPYIA
    Avoin AMK, tekniikka, ICT, Artificial Intelligence and Data-analytics

Objectives

The student understands the significance of data analytics in the digitalizing operational environment. The student knows the most commonly used methods and theories of data analytics as well as how to apply them in practice to existing data and interpret the results of the methods.

Course competences
EUYEN EUR-ACE: Engineering Analysis, Master's Degree
EUYEE EUR-ACE: Engineering Design, Master's Degree
EUYIV EUR-ACE: Investigations, Master's Degree

Content

- Python data analytics libraries: NumPy, Pandas, Matplotlib, Seaborn, Scipy
- Data visualization
- Processing of missing values and outliers
- Statistical terms: Average, standard deviation, correlation coefficient and their interpretations
- The concept of probability distribution, confidence interval and hypothesis testing.
- Bernoulli and Poisson processes
- Linear/logistic regression, decision trees

Learning materials and recommended literature

Joel Grus: Data Science from Scratch: First Principles with Python

Teaching methods

5 ECTS credits equals about 135 hours of study work.
Contact instruction: 36 hours
Exercises/Study project: 99 hours

Practical training and working life connections

It is advisable that the students use their own work environment as an object of study and as a data source.

Possible guest lecturers from companies.

Exam dates and retake possibilities

The course does not have an exam. The assessment is based on the evaluation assignments.

Content scheduling

There will be one or few contact sessions on saturdays at 9am - 3pm. The exact times are announced by the end of the year 2021. The place will be JAMK / Lutakko campus, Piippukatu, Jyväskylä.

5 ECTS credits equals about 135 hours of study work.
Contact instruction: 36 hours
Exercises/Study project: 99 hours

Further information for students

Avoin amk 5

Evaluation scale

0-5

Evaluation criteria, satisfactory (1-2)

**Assessment criteria, sufficient 1, satisfactory 2
Sufficient 1:

The student knows about the most commonly used techniques in data analytics in data analysis tasks. He/she is able to apply the most common techniques to analysing data and has sufficient knowledge of the mathematics behind the techniques. Additionally, the student is able to assess his/her implementation briefly.

Satisfactory 2:
The student knows the most commonly used techniques in data analytics in data analysis tasks. He/she is able to select the techniques for analysing data and apply his/her technical know-how in practice. Student understands the mathematics behind the techniques at a satisfying level. Additionally, the student is able to assess his/her implementation superficially.

Evaluation criteria, good (3-4)

Good 3:
The student is aware of the advantages of data analytics in the era of digitalization. The student knows the most commonly used techniques of data analytics in various data analysis tasks. Student understands well the mathematics behind the techniques at a good level. He/she is able to validate and select the techniques in data analysis and apply his/her technical know-how in practice. Additionally, the student is able to assess his/her implementation and validate its development.

Very good 4:
The student recognizes the advantages of data analytics in the era of digitalization. The student knows the most commonly used techniques of data analytics and is able to extensively validate the use of implemented techniques in various data analysis tasks. Student understands the mathematics behind the techniques at a very good level. He/she is able to versatilely validate and select the correct techniques for the analysis of data and apply his/her technical know-how to practice. Additionally, the student is able to assess his/her implementation profoundly and validate its development.

Evaluation criteria, excellent (5)

Excellent 5:
The student recognizes the advantages of data analytics in the era of digitalization. The student knows the most commonly used techniques in data analytics and is able to critically validate the use of implemented techniques in various data analysis tasks. Student understands the mathematics behind the techniques in excellent level. He/she is able to critically validate and select the correct techniques in data analysis regardless of the data to be analyzed and apply the technical know-how to practice. Additionally, the student is able to critically assess his/her implementation and validate its development.