CS 712 STATISTICAL PROGRAMMING (Course Material)

Similar documents
STA 225: Introductory Statistics (CT)

Probability and Statistics Curriculum Pacing Guide

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Grade 6: Correlated to AGS Basic Math Skills

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

Edexcel GCSE. Statistics 1389 Paper 1H. June Mark Scheme. Statistics Edexcel GCSE

Mathematics subject curriculum

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

The lab is designed to remind you how to work with scientific data (including dealing with uncertainty) and to review experimental design.

Algebra 2- Semester 2 Review

School of Innovative Technologies and Engineering

Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade

Numeracy Medium term plan: Summer Term Level 2C/2B Year 2 Level 2A/3C

Montana Content Standards for Mathematics Grade 3. Montana Content Standards for Mathematical Practices and Mathematics Content Adopted November 2011

CONSTRUCTION OF AN ACHIEVEMENT TEST Introduction One of the important duties of a teacher is to observe the student in the classroom, laboratory and

Office Hours: Mon & Fri 10:00-12:00. Course Description

CS Machine Learning

Stacks Teacher notes. Activity description. Suitability. Time. AMP resources. Equipment. Key mathematical language. Key processes

State University of New York at Buffalo INTRODUCTION TO STATISTICS PSC 408 Fall 2015 M,W,F 1-1:50 NSC 210

Sociology 521: Social Statistics and Quantitative Methods I Spring Wed. 2 5, Kap 305 Computer Lab. Course Website

AP Statistics Summer Assignment 17-18

Coimisiún na Scrúduithe Stáit State Examinations Commission LEAVING CERTIFICATE 2008 MARKING SCHEME GEOGRAPHY HIGHER LEVEL

Dublin City Schools Mathematics Graded Course of Study GRADE 4

learning collegiate assessment]

GCSE Mathematics B (Linear) Mark Scheme for November Component J567/04: Mathematics Paper 4 (Higher) General Certificate of Secondary Education

Introducing the New Iowa Assessments Mathematics Levels 12 14

Grade 2: Using a Number Line to Order and Compare Numbers Place Value Horizontal Content Strand

Statewide Framework Document for:

Lesson M4. page 1 of 2

OVERVIEW OF CURRICULUM-BASED MEASUREMENT AS A GENERAL OUTCOME MEASURE

Cal s Dinner Card Deals

EDCI 699 Statistics: Content, Process, Application COURSE SYLLABUS: SPRING 2016

Math Grade 3 Assessment Anchors and Eligible Content

Iowa School District Profiles. Le Mars

Chapters 1-5 Cumulative Assessment AP Statistics November 2008 Gillespie, Block 4

Trends in College Pricing

Physics 270: Experimental Physics

Science Fair Project Handbook

Functional Skills Mathematics Level 2 assessment

CAAP. Content Analysis Report. Sample College. Institution Code: 9011 Institution Type: 4-Year Subgroup: none Test Date: Spring 2011

Fourth Grade. Reporting Student Progress. Libertyville School District 70. Fourth Grade

Learning Disability Functional Capacity Evaluation. Dear Doctor,

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Research Design & Analysis Made Easy! Brainstorming Worksheet

PRIMARY ASSESSMENT GRIDS FOR STAFFORDSHIRE MATHEMATICS GRIDS. Inspiring Futures

Instructor: Mario D. Garrett, Ph.D. Phone: Office: Hepner Hall (HH) 100

Missouri Mathematics Grade-Level Expectations

Global Television Manufacturing Industry : Trend, Profit, and Forecast Analysis Published September 2012

Sociology 521: Social Statistics and Quantitative Methods I Spring 2013 Mondays 2 5pm Kap 305 Computer Lab. Course Website

Introduction to the Practice of Statistics

Firms and Markets Saturdays Summer I 2014

Introduction to Causal Inference. Problem Set 1. Required Problems

University of Groningen. Systemen, planning, netwerken Bosman, Aart

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

STA2023 Introduction to Statistics (Hybrid) Spring 2013

NCEO Technical Report 27

Extending Place Value with Whole Numbers to 1,000,000

Paper 2. Mathematics test. Calculator allowed. First name. Last name. School KEY STAGE TIER

Kindergarten Foundations of America

CHAPTER 4: REIMBURSEMENT STRATEGIES 24

Arizona s College and Career Ready Standards Mathematics

Lecture 1: Machine Learning Basics

GDP Falls as MBA Rises?

TRENDS IN. College Pricing

CHMB16H3 TECHNIQUES IN ANALYTICAL CHEMISTRY

TOPICS LEARNING OUTCOMES ACTIVITES ASSESSMENT Numbers and the number system

Mathematics Program Assessment Plan

Build on students informal understanding of sharing and proportionality to develop initial fraction concepts.

Hardhatting in a Geo-World

Degree Qualification Profiles Intellectual Skills

Focus of the Unit: Much of this unit focuses on extending previous skills of multiplication and division to multi-digit whole numbers.

Create A City: An Urban Planning Exercise Students learn the process of planning a community, while reinforcing their writing and speaking skills.

Standard 1: Number and Computation

AGRICULTURAL AND EXTENSION EDUCATION

Mathematics process categories

Linking the Ohio State Assessments to NWEA MAP Growth Tests *

OFFICE SUPPORT SPECIALIST Technical Diploma

Timeline. Recommendations

Measurement & Analysis in the Real World

EGRHS Course Fair. Science & Math AP & IB Courses

Developing an Assessment Plan to Learn About Student Learning

Honors Mathematics. Introduction and Definition of Honors Mathematics

Probability Therefore (25) (1.33)

Contents. Foreword... 5

Corpus Linguistics (L615)

Mathematics Scoring Guide for Sample Test 2005

Math 96: Intermediate Algebra in Context

Paper Reference. Edexcel GCSE Mathematics (Linear) 1380 Paper 1 (Non-Calculator) Foundation Tier. Monday 6 June 2011 Afternoon Time: 1 hour 30 minutes

LOUISIANA HIGH SCHOOL RALLY ASSOCIATION

Certified Six Sigma Professionals International Certification Courses in Six Sigma Green Belt

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Self Study Report Computer Science

(I couldn t find a Smartie Book) NEW Grade 5/6 Mathematics: (Number, Statistics and Probability) Title Smartie Mathematics

University of Waterloo School of Accountancy. AFM 102: Introductory Management Accounting. Fall Term 2004: Section 4

Mathematics Success Grade 7

Content Language Objectives (CLOs) August 2012, H. Butts & G. De Anda

An Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District

CHAPTER 4: RESEARCH DESIGN AND METHODOLOGY

ECON 365 fall papers GEOS 330Z fall papers HUMN 300Z fall papers PHIL 370 fall papers

Transcription:

CS 712 STATISTICAL PROGRAMMING (Course Material) ( For Private Circulation. Academic Purpose Only) For the Registered Students of CS 712,Statistical Programming to the Department of Computer Science, College of Computer Science, King Khalid University, Abha, Kingdom of Saudi Arabia Copy Rights All the rights reserved to Department of Computer Science, College of Computer Science, King Khalid University, Abha, Kingdom of Saudi Arabia.

Contents 1. INTRODUCTION TO STATISTICS... 7 1.1 Introduction... 7 1.1 Data types... 7 1.1.1 Qualitative vs. Quantitative Variables... 7 1.1.1 Discrete vs. Continuous Variables... 8 1.1.1 Univariate vs. Bivariate Data... 8 1.1 Definitions of Statistics... 9 1.1 Types of Statistics... 9 11.1 Descriptive Statistics... 9 1.1.1 Inferential Statistics... 9 1.1 Characteristics of Statistics... 11 1.1 Uses of Statistics... 11 1. APPLICATIONS OF STATISTICS... 11 1.1 Business... 11 1.1 In Economics... 11 1.1 In Mathematics... 11 1.1 In Banking... 11 1.1 In State Management (Administration)... 11 1.2. In Accounting and Auditing... 11 1.7 In Natural and Social Sciences... 11 1.8 In Astronomy... 11 1.9 Exercises... 11 1. COLLECTION OF STATISTICAL DATA... 11 1.1 Introduction on Collection of Data... 11 1.1.1 Statistical Data... 11 1.1 Types of Data:... 11 1.1.1 Primary Data:... 11 1.1.1 Secondary Data:... 11 1.1 Methods of Collecting Primary Data:... 11 1.1.1 Personal Investigation... 11

1.1.1 Through Investigation... 11 1.1.1 Collection through Questionnaire... 11 1.1 Methods of Collecting Secondary Data... 11 1.1.1 Official... 11 1.1.1 Semi-Official... 12 1.1 Difference between Primary and Secondary Data... 12 1.2 Editing of Data... 12 Exercise... 17 1. CLASSIFICATION OF DATA... 18 1.1 Introduction on Classification of Data... 18 1.1 Bases of Classification:... 18 1.1.1 Qualitative Base... 18 1.1.1 Quantitative Base... 18 1.1.1 Geographical Base... 18 1.1.1 Chronological or Temporal Base... 18 1.1 Types of Classification:... 19 1.1.1 One -way Classification:... 19 1.1.1 Two -way Classification:... 19 1.1.1 Multi -way Classification:... 19 1.1 Tabulation of Data... 19 1.1.1 Simple Tabulation or One-way Tabulation:... 19 1.1.1 Double Tabulation or Two-way Tabulation:... 11 1.1.1 Complex Tabulation:... 11 1.1 Construction of Statistical Table... 11 1.2 General Rules of Tabulation:... 11 1.7 Difference between Classification and Tabulation... 11 1.8 Exercise... 11 خطأ! اإلشارة المرجعية غير معر فة... DISTRIBUTION.1 FREQUENCY خطأ! اإلشارة المرجعية غير معر فة... Introduction 1.1 خطأ! اإلشارة المرجعية غير معر فة... Data: 1.1.1 Ungrouped Data or Raw خطأ! اإلشارة المرجعية غير معر فة... Data: 1.1.1 Grouped

خطأ! اإلشارة المرجعية غير معر فة... Array: 1.1.1 خطأ! اإلشارة المرجعية غير معر فة... Limits: 1.1 Class خطأ! اإلشارة المرجعية غير معر فة... Boundaries: 1.1.1 Class خطأ! اإلشارة المرجعية غير معر فة... Classes: 1.1.1 Open-end خطأ! اإلشارة المرجعية غير معر فة... Point 1.1.1 Class Mark or Mid خطأ! اإلشارة المرجعية غير معر فة... Interval: 1.1.1 Size of Class خطأ! اإلشارة المرجعية غير معر فة... Data 1.1 Frequency Distribution of Grouped خطأ! اإلشارة المرجعية غير معر فة... Method 1.1 Frequency Distribution by Exclusive خطأ! اإلشارة المرجعية غير معر فة... Data 1.1 Frequency Distribution of Discrete خطأ! اإلشارة المرجعية غير معر فة... Distribution 1.2 Cumulative Frequency خطأ! اإلشارة المرجعية غير معر فة... distribution: 1.1.1 Less than cumulative frequency خطأ! اإلشارة المرجعية غير معر فة... distribution: 1.1.1 More than cumulative frequency خطأ! اإلشارة المرجعية غير معر فة... Exercise 1.2 خطأ! اإلشارة المرجعية غير معر فة... DATA 2. DIAGRAMS AND GRAPHS OF STATISTICAL خطأ! اإلشارة المرجعية غير معر فة... data 2.1 Graphical representation of خطأ! اإلشارة المرجعية غير معر فة... Diagrams/Charts: 2.1.1 Types of خطأ! اإلشارة المرجعية غير معر فة... Diagrams/Charts: 2.1.1 Types of خطأ! اإلشارة المرجعية غير معر فة... Chart 2.1 Simple Bar خطأ! اإلشارة المرجعية غير معر فة... Chart 2.1 Multiple Bar خطأ! اإلشارة المرجعية غير معر فة... Chart 2.1 Pie خطأ! اإلشارة المرجعية غير معر فة... Exercise 2.1 خطأ! اإلشارة المرجعية غير معر فة... DATA 7. PARTITION OF خطأ! اإلشارة المرجعية غير معر فة... Deciles 7.1 خطأ! اإلشارة المرجعية غير معر فة... Percentiles 7.1 خطأ! اإلشارة المرجعية غير معر فة... Average 7.1 خطأ! اإلشارة المرجعية غير معر فة... Average: 7.1.1 خطأ! اإلشارة المرجعية غير معر فة... Average 7.1.1 Desirable Qualities of a Good خطأ! اإلشارة المرجعية غير معر فة... Averages: 7.1 Types of خطأ! اإلشارة المرجعية غير معر فة... Mean 7.1.1 Arithmetic خطأ! اإلشارة المرجعية غير معر فة... Mode 7.1 Concept of خطأ! اإلشارة المرجعية غير معر فة. Data:... 7.1 1 Mode from Discrete

خطأ! اإلشارة المرجعية غير معر فة... Median 7.2 خطأ! اإلشارة المرجعية غير معر فة... Data 7.2.1 Median from Ungrouped خطأ! اإلشارة المرجعية غير معر فة... Data 7.2.1 Median from Grouped خطأ! اإلشارة المرجعية غير معر فة... Data 7.2.1 Median from Discrete خطأ! اإلشارة المرجعية غير معر فة... Exercise 7.7 خطأ! اإلشارة المرجعية غير معر فة... DEVIATION.8 MEAN خطأ! اإلشارة المرجعية غير معر فة. Deviation... 8.1 The Mean خطأ! اإلشارة المرجعية غير معر فة... data 8.1 Mean Deviation for Ungrouped خطأ! اإلشارة المرجعية غير معر فة... Exercise 8.1 خطأ! اإلشارة المرجعية غير معر فة... DEVIATION 9. VARIANCE AND STANDARD خطأ! اإلشارة المرجعية غير معر فة... Variance 9.1 خطأ! اإلشارة المرجعية غير معر فة. Deviation... 9.1 Standard خطأ! اإلشارة المرجعية غير معر فة... Method 9.1.1 Actual Mean خطأ! اإلشارة المرجعية غير معر فة... Data 9.1.1 Ungrouped خطأ! اإلشارة المرجعية غير معر فة... data 9.1 Standard Deviation for Grouped خطأ! اإلشارة المرجعية غير.. data 9.1.1 Standard Deviation General procedure for Grouped معر فة. خطأ! اإلشارة المرجعية غير معر فة... Exercise 9.1 خطأ! اإلشارة المرجعية غير.. CORRELATION 11. INTRODUCTION TO REGRESSION AND معر فة. خطأ! اإلشارة المرجعية غير معر فة... Correlation 11.1 خطأ! اإلشارة المرجعية غير معر فة... Correlation 11.1.1 Positive خطأ! اإلشارة المرجعية غير معر فة... Correlation 11.1.1 Negative خطأ! اإلشارة المرجعية غير معر فة... Correlation 11.1.1 No Correlation or Zero خطأ! اإلشارة المرجعية غير معر فة... Correlation 11.1.1 Perfect خطأ! اإلشارة المرجعية غير معر فة... Correlation: 11.1.1 Perfect Positive خطأ! اإلشارة المرجعية غير معر فة... Correlation: 11.1.2 Perfect Negative خطأ! اإلشارة المرجعية غير معر فة... correlation 11.1 Coefficient of خطأ! اإلشارة المرجعية غير معر فة... Correlation 11.1.1 Positive خطأ! اإلشارة المرجعية غير معر فة... Correlation 11.1.1 Negative خطأ! اإلشارة المرجعية غير معر فة... Correlation 11.1.1 Zero

خطأ! اإلشارة المرجعية غير معر فة... Correlation 11.1.1 Moderate Positive خطأ! اإلشارة المرجعية غير معر فة... Exercise 11.1 خطأ! اإلشارة المرجعية غير معر فة... REGRESSION.11 خطأ! اإلشارة المرجعية غير معر فة... Regression 11.1 Definition خطأ! اإلشارة المرجعية غير معر فة... Regression 11.1 Linear خطأ! اإلشارة المرجعية غير معر فة... Regression 11.1 Computation of خطأ! اإلشارة المرجعية غير معر فة... Regression 11.1 Observation on خطأ! اإلشارة المرجعية غير معر فة... Exercise 11.1 خطأ! اإلشارة المرجعية غير معر فة... PROBABILITY.11 خطأ! اإلشارة المرجعية غير معر فة. Experiment... 11.1 Random خطأ! اإلشارة المرجعية غير معر فة... Space 11.1 Sample خطأ! اإلشارة المرجعية غير معر فة... probability 11.1 Event of a خطأ! اإلشارة المرجعية غير معر فة... Probability خطأ! اإلشارة المرجعية غير معر فة... Outcomes 11.1 Equally Likely خطأ! اإلشارة المرجعية غير معر فة... Outcomes: 11.1 Not Equally Likely خطأ! اإلشارة المرجعية غير معر فة... Events 11.2 Mutually Exclusive خطأ! اإلشارة المرجعية غير معر فة... Events 11.7 Not Mutually Exclusive خطأ! اإلشارة المرجعية غير معر فة... computations 11.8 Probability خطأ! اإلشارة المرجعية غير معر فة... Probability 11.9 Conditional خطأ! اإلشارة المرجعية غير معر فة... Exercise 11.11

1. INTRODUCTION TO STATISTICS 1.1 Introduction The Word statistics have been derived from Latin word Status or the Italian word Statista, meaning of these words is Political State or a Government. In the past, the statistics was used by rulers. The application of statistics was very limited but rulers and kings needed information about lands, agriculture, commerce, population of their states to assess their military potential, their wealth, taxation and other aspects of government. The word statistics refer to numerical facts and figures collected in a systematic manner with a definite purpose in any field of study. In this sense, statistics are also aggregates of facts which are expressed in numerical form. Statistics refers to the science comprising methods which are used in collection, analysis, interpretation and presentation of numerical data. These methods are used to draw conclusion about the population parameter. 1.7 Data types The data is classified into variable and constant. variable has two defining characteristics: A variable is an attribute that describes a person, place, thing, or idea. The value of the variable can "vary" from one entity to another. For example, a person's hair color is a potential variable, which could have the value of "blond" for one person and "brunette" for another. 1.7.1 Qualitative vs. Quantitative Variables Variables can be classified as qualitative (aka, categorical) or quantitative (aka, numeric). Qualitative. Qualitative variables take on values that are names or labels. The color of a ball (e.g., red, green, blue) or the breed of a dog (e.g., collie, shepherd, terrier) would be examples of qualitative or categorical variables.

Quantitative. Quantitative variables are numeric. They represent a measurable quantity. For example, when we speak of the population of a city, we are talking about the number of people in the city - a measurable attribute of the city. Therefore, population would be a quantitative variable. In algebraic equations, quantitative variables are represented by symbols (e.g., x, y, or z). 1.7.7 Discrete vs. Continuous Variables Quantitative variables can be further classified as discrete or continuous. If a variable can take on any value between its minimum value and its maximum value, it is called a continuous variable; otherwise, it is called a discrete variable. Some examples will clarify the difference between discrete and continouous variables. Suppose the fire department mandates that all fire fighters must weigh between 051 and 051 pounds. The weight of a fire fighter would be an example of a continuous variable; since a fire fighter's weight could take on any value between 051 and 051 pounds. Suppose we flip a coin and count the number of heads. The number of heads could be any integer value between 1 and plus infinity. However, it could not be any number between 1 and plus infinity. We could not, for example, get 0.2 heads. Therefore, the number of heads must be a discrete variable. 1.7.1 Univariate vs. Bivariate Data Statistical data are often classified according to the number of variables being studied. Univariate data. When we conduct a study that looks at only one variable, we say that we are working with univariate data. Suppose, for example, that we conducted a survey to estimate the average weight of high school students. Since we are only working with one variable (weight), we would be working with univariate data.

Bivariate data. When we conduct a study that examines the relationship between two variables, we are working with bivariate data. Suppose we conducted a study to see if there were a relationship between the height and weight of high school students. Since we are working with two variables (height and weight), we would be working with bivariate data. 1.7 Definitions of Statistics Statistics was defined as the science of kings, political and science of statecraft A.L. Bowley defined statistics as statistics is the science of counting A.L. Bowley has also defined as statistics is science of averages Prof: Boddington has defined statistics as science of estimate and probabilities Statistics are the numerical statement of facts capable of analysis and interpretation and the science of statistics is the study of the principles and the methods applied in collecting, presenting, analysis and interpreting the numerical data in any field of inquiry. 1.1 Types of Statistics Statistics may be divided into two main branches: (0) Descriptive Statistics (0) Inferential Statistics 11.1 Descriptive Statistics In descriptive statistics, it deals with collection of data, its presentation in various forms, such as tables, graphs and diagrams and findings averages and other measures which would describe the data. For Example: Industrial statistics, population statistics, trade statistics etc Such as businessman make to use descriptive statistics in presenting their annual reports, final accounts, bank statements. 1.1.7 Inferential Statistics In inferential statistics, it deals with techniques used for analysis of data, making the estimates and drawing conclusions from limited information taken on sample basis and testing the reliability of the estimates.

For Example: Suppose we want to have an idea about the percentage of illiterates in our country. We take a sample from the population and find the proportion of illiterates in the sample. This sample proportion with the help of probability enables us to make some inferences about the population proportion. This study belongs to inferential statistics. 1.4 Characteristics of Statistics Some of its important characteristics are given below: Statistics are aggregates of facts. Statistics are numerically expressed. Statistics are affected to a marked extent by multiplicity of causes. Statistics are enumerated or estimated according to a reasonable standard of accuracy. Statistics are collected for a predetermine purpose. Statistics are collected in a systemic manner. Statistics must be comparable to each other. 1.5 Uses of Statistics (0) Statistics helps in providing a better understanding and exact description of a phenomenon of nature. (0) Statistical helps in proper and efficient planning of a statistical inquiry in any field of study. (2) Statistical helps in collecting an appropriate quantitative data. (4) Statistics helps in presenting complex data in a suitable tabular, diagrammatic and graphic form for an easy and clear comprehension of the data. (5) Statistics helps in understanding the nature and pattern of variability of a phenomenon through quantitative observations. (6) Statistics helps in drawing valid inference, along with a measure of their reliability about the population parameters from the sample data.

7. APPLICATIONS OF STATISTICS Statistics plays a vital role in every fields of human activity. Statistics has important role in determining the existing position of per capita income, unemployment, population growth rate, housing, schooling medical facilities etc in a country. Now statistics holds a central position in almost every field like Industry, Commerce, Trade, Physics, Chemistry, Economics, Mathematics, Biology, Botany, Psychology, Astronomy etc, so application of statistics is very wide. Now we discuss some important fields in which statistics is commonly applied. 7.1 Business Statistics play an important role in business. A successful businessman must be very quick and accurate in decision making. He knows that what his customers wants, he should therefore, know what to produce and sell and in what quantities. Statistics helps businessman to plan production according to the taste of the costumers, the quality of the products can also be checked more efficiently by using statistical methods. So all the activities of the businessman based on statistical information. He can make correct decision about the location of business, marketing of the products, financial resources etc 7.7 In Economics Statistics play an important role in economics. Economics largely depends upon statistics. National income accounts are multipurpose indicators for the economists and administrators. Statistical methods are used for preparation of these accounts. In economics research statistical methods are used for collecting and analysis the data and testing hypothesis. The relationship between supply and demands is studies by statistical methods, the imports and exports, the inflation rate, the per capita income are the problems which require good knowledge of statistics. 7.1 In Mathematics Statistical plays a central role in almost all natural and social sciences. The methods of natural sciences are most reliable but conclusions draw from them are only

probable, because they are based on incomplete evidence. Statistical helps in describing these measurements more precisely. Statistics is branch of applied mathematics. The large number of statistical methods like probability averages, dispersions, estimation etc is used in mathematics and different techniques of pure mathematics like integration, differentiation and algebra are used in statistics. 7.4 In Banking Statistics play an important role in banking. The banks make use of statistics for a number of purposes. The banks work on the principle that all the people who deposit their money with the banks do not withdraw it at the same time. The bank earns profits out of these deposits by lending to others on interest. The bankers use statistical approaches based on probability to estimate the numbers of depositors and their claims for a certain day. 7.5 In State Management (Administration) Statistics is essential for a country. Different policies of the government are based on statistics. Statistical data are now widely used in taking all administrative decisions. Suppose if the government wants to revise the pay scales of employees in view of an increase in the living cost, statistical methods will be used to determine the rise in the cost of living. Preparation of federal and provincial government budgets mainly depends upon statistics because it helps in estimating the expected expenditures and revenue from different sources. So statistics are the eyes of administration of the state. 7.6. In Accounting and Auditing Accounting is impossible without exactness. But for decision making purpose, so much precision is not essential the decision may be taken on the basis of approximation, know as statistics. The correction of the values of current asserts is made on the basis of the purchasing power of money or the current value of it. In auditing sampling techniques are commonly used. An auditor determines the sample size of the book to be audited on the basis of error.

7.2 In Natural and Social Sciences Statistics plays a vital role in almost all the natural and social sciences. Statistical methods are commonly used for analyzing the experiments results, testing their significance in Biology, Physics, Chemistry, Mathematics, Meteorology, Research chambers of commerce, Sociology, Business, Public Administration, Communication and Information Technology etc 7.8 In Astronomy Astronomy is one of the oldest branch of statistical study, it deals with the measurement of distance, sizes, masses and densities of heavenly bodies by means of observations. During these measurements errors are unavoidable so most probable measurements are founded by using statistical methods. Example: This distance of moon from the earth is measured. Since old days the astronomers have been statistical methods like method of least squares for finding the movements of stars. 7.2 Exercises Choose the correct Answers: 1. is one of the oldest branch of statistical study, it deals with the measurement of distance, sizes, masses and densities of heavenly bodies by means of observations. a) Accounting b) Astronomy c) Economics 1. deals with techniques used for analysis of data, making the estimates and drawing conclusions from limited information taken on sample basis and testing the reliability of the estimates. a) Descriptive Statistics b) Bivariate data c) Inferential statistics 1. deals with representation of data b) Descriptive Statistics b) Bivariate data c) Inferential statistics Short Questions: 1. Define Univariate data. 1. List the various field of application of statistics. 2. Define Descriptive Statistics.

1. COLLECTION OF STATISTICAL DATA 1.1 Introduction on Collection of Data The first step in any enquiry (investigation) is collection of data. The data may be collected for the whole population or for a sample only. It is mostly collected on sample basis. Collection of data is very difficult job. The enumerator or investigator is the well trained person who collects the statistical data. The respondents (information) are the persons whom the information is collected. 1.1.1 Statistical Data A sequence of observation, made on a set of objects included in the sample drawn from population is known as statistical data. (0) Ungrouped Data: Data which have been arranged in a systematic order are called raw data or ungrouped data. (0) Grouped Data: Data presented in the form of frequency distribution is called grouped data. 1.7 Types of Data: There are two types (sources) for the collection of data. (0) Primary Data (0) Secondary Data 1.7.1 Primary Data: The primary data are the first hand information collected, compiled and published by organization for some purpose. They are most original data in character and have not undergone any sort of statistical treatment. Example: Population census reports are primary data because these are collected, complied and published by the population census organization.

1.7.7 Secondary Data: The secondary data are the second hand information which are already collected by someone (organization) for some purpose and are available for the present study. The secondary data are not pure in character and have undergone some treatment at least once. Example: Economics survey of England is secondary data because these are collected by more than one organization like Bureau of statistics, Board of Revenue, the Banks etc 1.1 Methods of Collecting Primary Data: Primary data are collected by the following methods: 1.1.1 Personal Investigation The researcher conducts the survey him/herself and collects data from it. The data collected in this way is usually accurate and reliable. This method of collecting data is only applicable in case of small research projects. 1.1.7 Through Investigation Trained investigators are employed to collect the data. These investigators contact the individuals and fill in questionnaire after asking the required information. Most of the organizing implied this method. 1.1.1 Collection through Questionnaire The researchers get the data from local representation or agents that are based upon their own experience. This method is quick but gives only rough estimate. Through Telephone: The researchers get information through telephone this method is quick and give accurate information. 2.4 Methods of Collecting Secondary Data The secondary data are collected by the following sources: 1.4.1 Official The publications of the Statistical Division, Ministry of Finance, the Federal Bureaus of Statistics, Ministries of Food, Agriculture, Industry, Labor etc

1.4.7 Semi-Official State Bank, Railway Board, Central Cotton Committee, Boards of Economic Enquiry etc Publication of Trade Associations, Chambers of Commerce etc Technical and Trade Journals and Newspapers. Research Organizations such as Universities and other institutions. 1.5 Difference between Primary and Secondary Data The difference between primary and secondary data is only a change of hand. The primary data are the first hand data information which is directly collected form one source. They are most original data in character and have not undergone any sort of statistical treatment while the secondary data are obtained from some other sources or agencies. They are not pure in character and have undergone some treatment at least once. For Example: Suppose we interested to find the average age of MS students. We collect the age s data by two methods; either by directly collecting from each student himself personally or getting their ages from the university record. The data collected by the direct personal investigation is called primary data and the data obtained from the university record is called secondary data. 1.6 Editing of Data After collecting the data either from primary or secondary source, the next step is its editing. Editing means the examination of collected data to discover any error and mistake before presenting it. It has to be decided before hand what degree of accuracy is wanted and what extent of errors can be tolerated in the inquiry. The editing of secondary data is simpler than that of primary data.

1.2 Exercise Choose the correct Answers: 0. The are the first hand information collected, compiled and published by organization for some purpose. a) Primary data b) Secondary data c) Ungrouped data 0. The are the second hand information which are already collected by someone (organization) for some purpose and are available for the present study. a) Primary data b) Secondary data c) Ungrouped data Short Questions: 1. What is difference between primary data and secondary data? 1. List the different collection method for primary data.

4. CLASSIFICATION OF DATA 4.1 Introduction on Classification of Data The process of arranging data into homogenous group or classes according to some common characteristics present in the data is called classification. For Example: The process of sorting letters in a post office, the letters are classified according to the cities and further arranged according to streets. 4.7 Bases of Classification: There are four important bases of classification: (0) Qualitative Base (0) Quantitative Base (2) Geographical Base (4) Chronological or Temporal Base 4.7.1 Qualitative Base When the data are classified according to some quality or attributes such as sex, religion, literacy, intelligence etc 4.0.0 Quantitative Base When the data are classified by quantitative characteristics like heights, weights, ages, income etc 4.7.1 Geographical Base When the data are classified by geographical regions or location, like states, provinces, cities, countries etc 4.7.4 Chronological or Temporal Base When the data are classified or arranged by their time of occurrence, such as years, months, weeks, days etc For Example: Time series data.

4.1 Types of Classification: 4.1.1 One -way Classification: If we classify observed data keeping in view single characteristic, this type of classification is known as one-way classification. For Example: The population of world may be classified by religion as Muslim, Christians etc 4.1.7 Two -way Classification: If we consider two characteristics at a time in order to classify the observed data then we are doing two way classifications. For Example: The population of world may be classified by Religion and Sex. 4.1.1 Multi -way Classification: We may consider more than two characteristics at a time to classify given data or observed data. In this way we deal in multi-way classification. For Example: The population of world may be classified by Religion, Sex and Literacy. 4.4 Tabulation of Data The process of placing classified data into tabular form is known as tabulation. A table is a symmetric arrangement of statistical data in rows and columns. Rows are horizontal arrangements whereas columns are vertical arrangements. It may be simple, double or complex depending upon the type of classification. 4.4.1 Simple Tabulation or One-way Tabulation: When the data are tabulated to one characteristic, it is said to be simple tabulation or one-way tabulation. For Example: Tabulation of data on population of world classified by one characteristic like Religion is example of simple tabulation. GCC Countries Saudi Arabia Oman Dubai Bahrain

4.4.7 Double Tabulation or Two-way Tabulation: The data are tabulated according to two characteristics at a time. It is said to be double tabulation or two-way tabulation. For Example: Tabulation of data on population of world classified by two characteristics like Religion and Sex is example of double tabulation. Example : students marks in different subjects Students no Subject 0 Subject 0 Subject 2 0 04 01 01 0 02 01 01 4.4.1 Complex Tabulation: When the data are tabulated according to many characteristics, it is said to be complex tabulation. For Example: Tabulation of data on population of world classified by two characteristics like Religion, Sex and Literacy etc is example of complex tabulation. 4.5 Construction of Statistical Table A statistical table has at least four major parts and some other minor parts. (0) The Title (0) The Box Head (column captions) (2) The Stub (row captions) (4) The Body (5) Prefatory Notes (6) Foots Notes (7) Source Notes

The general sketch of table indicating its necessary parts is shown below: ----THE TITLE---- ----Prefatory Notes---- ----Box Head---- ----Row Captions---- ----Column Captions---- ----Stub Entries---- ----The Body---- Foot Source Notes Notes 4.6 General Rules of Tabulation: A table should be simple and attractive. There should be no need of further explanations (details). Proper and clear headings for columns and rows should be need. Suitable approximation may be adopted and figures may be rounded off. The unit of measurement should be well defined. If the observations are large in number they can be broken into two or three tables. Thick lines should be used to separate the data under big classes and thin lines to separate the sub classes of data. 4.2 Difference between Classification and Tabulation (0) First the data are classified and then they are presented in tables, the classification and tabulation in fact goes together. So classification is the basis for tabulation. (0) Tabulation is a mechanical function of classification because in tabulation classified data are placed in row and columns.

(2) Classification is a process of statistical analysis where as tabulation is a process of presenting the data in suitable form 4.8 Exercise Choose the correct Answers: 0. The process of arranging data into homogenous group or classes according to some common characteristics present in the data is called. a) classification b) Qualitative Base c) Quantitative Base 0. The data are classified or arranged by their time of occurrence, such as years, months, weeks, days etc., it is called as a) Geographical Base b) Chronological base c) Quantitative Base 2. When the data are tabulated according to many characteristics, it is said to be. a) One way tabulation b) Two way tabulation c) Complex tabulation Short Questions: 0. List base of classification. 0. What is called classification of data and give its examples? 2. What are the major parts involved in Construction of Statistical Table? 4. What are General Rules of Tabulation? 5. What is difference between classification and tabulation? Long Questions: 0. List the types of classification and explain each type with examples. 0. How to Construction of Statistical Table and gives its general rules with examples?