Tribhuvan University

Institute of Science and Technology

2078

Bachelor Level / third-semester / Science

Computer Science and Information Technology( STA215 )

Statistics II

Full Marks: 60 + 20 + 20

Pass Marks: 24 + 8 + 8

Time: 3 Hours

Candidates are required to give their answers in their own words as far as practicable.

The figures in the margin indicate full marks.

Group A

Attempt any TWO questions

1

There are three brands of computers namely Dell, Lenovo, and HP. The following are the lifetime of 15 computers in years.

Serial NumberComputer BrandLifetime in years
1Dell15
2Lenovo10
3HP9
4Dell12
5Lenovo6
6HP7
7Dell4
8Lenovo8
9HP13
10Dell11
11HP5
12Lenovo7
13Dell3
14HP5
15Lenovo4

Apply appropriate statistical tests to identify whether the average lifetime (in years) is significantly different across three rands of computers at a 5% level of significance. You can again tabulate the data initially in the required format for statistical analysis.

2

Explain the sample distribution of mean with reference to some numerical example. Illustrate the practical implications of the Central Limit Theorem (CLT) in inferential statistics.

3

A study was conducted among IT officers working in different IT Centers in Kathmandu valley. One of the objectives of the study was to quantify the effect of age and working hours per day on Computer Vision Syndrome (CVS). The CVS was measured in a continuum measurement scale varying from 0 to 50. A few parts of the data were taken randomly from the surveyed data and provided in the following table for the statistical analysis.

Respondent’s ID00100712523199299145
Scales of CVS67511329.028
Age of respondents (in years)24263041475052
Working hour(per day)4568367

Recognize which one is the dependent variable. Assuming that the relationship between CVS, age, and working hours is linear. Fit a multiple linear regression model to address the objective of the study and interpret the model appropriately.

Group B

Attempt any EIGHT questions

4

The following are the details of working hours in the classroom per week of male and female faculty working in the area of Computer Science and Information Technology at Tribhuvan University.

Male FacultyFemale Faculty
Sample Size6030
Average working hours per week129
The standard deviation of a working hour per week43

Apply independent t-test to examine the average working hour in the classroom per week is significantly different between male and female faculty, at 1% level of significance. State also null and alternative hypotheses appropriately.

5

A survey was conducted among 70 students studying B.Sc. CSIT in some colleges randomly. Among them, 50 students secured more than 80% marks in statistics. Compute 99% and 95% confidence intervals for the population proportion of students who secured more than 80% marks in subject statistics, and comment on the results.

6

In location 1, there are 250 corona-positive cases out of 460 persons, and in location 2, 250 positive cases were reported out of 650 persons. Can it be concluded that the proportion of corona-positive cases is higher in location 1 compared to location 2? Test at a 10% level of significance.

7

Previous literature has reported that the average age of Bsc.CSIT enrolling students in Tribhuvan University is 22 years. A researcher has doubts about this information and he feels that the average age is less than 22 years. In order ti examine this, the following sample data were collected randomly from the enrolling students of CSIT.

Age in years20192223192021201920

Set up null and alternative hypotheses and test whether the researcher’s doubt will be justified. Use 5% level of significance. Assume that the parent population from which samples are drawn is normally distributed.

8

Apply the Mann-Whitney U test for examining the following knowledge score on IT among two groups of IT workers at a 5% level of significance.

Group A:58276
Group B:91246
9

A survey was conduct to see the association between hacking status of the email and the type of email account. The survey has reported the following cross tabulation.

Type of e-mail accountHacking status
YesNo
Yahoo6015
 Gmail20120

Do the information provide sufficient evidence to conclude that the type email account and the hacking status is associated? Use Chi-square test at 1% level of significance.

10

State the mathematical model for Statistical analysis for m x m LSD for one observation per experimental unit. Also prepare a dummy ANOVA table for this.

11

Define the Markov chain and introduce its basic notations. Also, explain the characteristics of a Markov chain.

12

Write short notes on the following:

  1. The rationale of using the non-parametric statistical test
  2. Estimation of minimum size for  the given proportion