Data Processing






Data processing

Data
Data are normally defined as a representation of facts, concepts and instructions during a formalized manner with which it should be suitable for communication, interpretation, or processing by human or electronic machine.

Data is represented with the help of characters like alphabets (A-Z,a-z), digits (0-9) or special characters(+,-,/,*,,= etc.).
Raw Computer Data
What is Information?

Information is processed,organised or classified data which has some meaningful values for the receiver.

Information is that the processed data on which decisions and actions are based.

For the choice to be meaningful, the processed data must qualify for the subsequent characteristics:

 Timely - Information should be available when required.
 Accuracy - Information should be accurate.
 Completeness - Information should be complete.

Data Processing Cycle

Data processing is that the re-structuring or re-ordering of knowledge by people or machine to extend their usefulness and add values for particular purpose. Data processing has three basic steps include input, processing and output. These three steps constitute the data processing cycle.
Computer Data

 Input - during this step the input file is ready in some convenient form for processing. The form will depend on the processing machine. For example, when electronic computers are used, the input file might be recorded on anybody of several sorts of input medium, like magnetic disks, tapes then on.

Processing - during this step input file is modified to supply data during a more useful form. For example, pay-checks could also be calculated from the time cards, or a summary of sales for the month could also be calculated from the sales orders.

Output - The results of the processing steps are called output. The particular sort of the output data depends on the utilization of the info. For example, output data could also be pay-checks for workers.





Data types

Data types are generally used to within type systems, which supply various ways of defining, implementing and using them. Different types of data types in systems ensure that varying degrees of type safety.
Practically all programming dialects incorporate the thought of knowledge type; They have used different terminology in different languages. Common data types include:

 * integers,
 * booleans,
 * characters,
 * floating-point numbers,
 * alphanumeric strings.

For example, within the Java programming language, the "int" type represents the set of 32-bit integers ranging in value from -2,147,483,648 to 2,147,483,647, also because the operations which will be performed on integers, like addition, subtraction, and multiplication.
Most of the programming languages are also allowed the programmer to define additional data types, usually by combining multiple elements of other types and defining the valid operations of the new data type. For example, a programmer might create a replacement data type named "complex number" that might include real and imaginary parts. A data type also represents a constraint placed upon the interpretation of knowledge during a type system; it is describing the representation, interpretation and structure of values or objects stored in memory. The type system uses data type information to see correctness of computer programs that access or manipulate the information.
Most data types in statistics have comparable types in programming, and the other way around, as shown within the following table:
Statistics Programming
real-valued (interval scale) floating-point
real-valued (ratio scale)
count data (usually non-negative) integer
binary data Boolean
categorical data enumerated type
random vector list or array
random matrix two-dimensional array
random tree tree

The definition in terms of a representation was often wiped out imperative languages like ALGOL and Pascal, while the definition in terms of a worth space and behaviour was used in higher-level languages like Simula and CLU.
Classes of data types
Primitive data types
Main article: Primitive data type
Machine data types

All data in computers supported digital electronics is represented as bits (alternatives 0 and 1) on rock bottom level. The smallest addressable unit of knowledge is typically a gaggle of bits called a byte (usually an octet, which is 8 bits). The unit processed by machine language instructions is named a word (as of 2011, typically 32 or 64 bits). Most instructions have interpreted the word as a binary number, such that a 32-bit word which can represent unsigned integer values from 0 to 232-1 or signed integer values from -231 to 231-1. Because of two's complement, the machine language and machine doesn't have to distinguish between these unsigned and signed data types for the foremost part.

There is a selected set of arithmetic instructions that use a special interpretation of the bits in word as a number.

Machine data types got to be exposed or made available in systems or low-level programming languages, allowing fine-grained control over hardware. The C programming language, as an example, supplies integer sorts of various widths, like short and long. If a corresponding native type doesn't exist on the target platform, the compiler will break them down into code using types that do exist. For instance, if a 32-bit integer is requested on a 16 bit platform, the compiler will tacitly treat it as an array of two 16 bit integers.

Most of the languages have been using binary and hexadecimal literals, for convenient manipulation of machine data.

Higher level programming languages use machine data types which are abstracted as an implementation detail that might render code less portable if exposed. For instance, a generic numeric type could be supplied rather than integers of some specific bit-width.

Boolean type

The Boolean type is represented the values of true and false. Although only two values are possible, they're rarely implemented as one digit for efficiency reasons. Many programming languages don't have a particular Boolean type, instead interpreting (for instance) 0 as false and other values as true. Boolean data simply refers to the logical structure of how the language is interpreted to the machine language.  A Boolean 0 refers to the logic False. True is usually a non zero, especially 1 is understood as Boolean 1.
Numeric types

Such as:
* The integer data types, or "whole numbers". May be sub-typed consistent with their ability to contain negative values (e.g. unsigned in C and C++). May also have alittle number of predefined subtypes (such as short and long in C/C++); or allow users to freely define subranges like 1..12 (e.g. Pascal/Ada).
 * Floating point data types are called reals, contain fractional values. They have normally predefined limits on both their maximum values and their precision. These are often represented as decimal numbers.
 * Fixed point data types are conveniently used for representing monetary values. They are often implemented internally as integers, which is resulting in predefined limits.
 * Big numbers or arbitrary precision numeric types are lack of predefined limits. They are not primitive types, and they are used sparingly for efficiency reasons.

Composite types
Main article: Composite type

Composite types are derived from quite one primitive type. This can be wiped out variety of the way . The ways they're combined are called data structures. Composing a primitive type into a compound type generally leads to a replacement type, e.g. array-of-integer is a different type to integer.

 * An array stores variety of elements of an equivalent type during a specific order. They are accessed randomly using an integer to specify which element is required (although the weather could also be of just about any type). Arrays may be fixed-length or expandable.
 A list is analogous to an array, but its contents are strung together by a series of references to subsequent element.
 * Record (also called tuple or struct) Records are among the only data structures. A record may be a value that contains other values, typically in fixed number and sequence and typically indexed by names. The elements of records are usually called fields or members.
 * Union. A union type definition will specify which of variety of permitted primitive types could also be stored in its instances, e.g. "float or long integer". Contrast with a record, which might be defined to contain a float and an integer; whereas, during a union, there's just one value at a time.
 * A tagged union (also called a variant, variant record, discriminated union, or disjoint union) contains a further field indicating its current type, for enhanced type safety.
 * a group is an abstract arrangement which will store certain values, with none particular order, and no repeated values. Values themselves aren't retrieved from sets, rather one tests a worth for membership to get a boolean "in" or "not in".
 * An object contains variety of knowledge fields, sort of a record, and also variety of subroutines for accessing or modifying them, called methods. In contrast, data structures not containing code, like those above, are called plain old data structures.
Many others are possible, but they have a tendency to be further variations and compounds of the above.

Enumerations

Main article: Enumerated type
The enumerated type. This has values which are different from one another , and which may be compared and assigned, but which don't necessarily have any particular concretism within the computer's memory; compilers and interpreters can represent them arbitrarily. For example, the four suits during a deck of playing cards could also be four enumerators named CLUB, DIAMOND, HEART, SPADE, belonging to an enumerated type named suit. If a variable A is said to be its data type, one can assign any of these four values there. Some implementations allow programmers to assign integer values to the enumeration values, or maybe treat them as type-equivalent to integers.

String and text types
Such as:
 * Alphanumeric character. A letter of the alphabet, digit, space , punctuation , etc.
 * Alphanumeric strings, a sequence of characters. They are typically wont to represent words and text.

Character and string types can store sequences of characters from a personality set like ASCII. Since most character sets include the digits, it's possible to possess a numeric string, like "1234". However, many languages would still treat these as belonging to a special type to the numeric value 1234.

Character and string types can have different subtypes consistent with the specified character "width". The original 7-bit wide ASCII was found to be limited, and superseded by 8 and 16-bit sets, which may encode a good sort of non-Latin alphabets (Hebrew, Chinese) and other symbols. Strings could also be either stretch-to-fit or of fixed size, even within the same programing language . They may also be sub typed by their maximum size.

Note: strings aren't primitive altogether languages, as an example C: they'll be composed from arrays of characters.
Other types

Types are often supported , or derived from, the essential types explained above. In some languages, such as C, functions have a kind derived from the sort of their return value.
Pointers and references

The main non-composite, derived type is that the pointer, a knowledge type whose value refers on to (or "points to") another value stored elsewhere within the memory using its address. It is a primitive kind of reference. (In everyday terms, a pagination during a book might be considered a bit of knowledge that refers to a different one). Pointers are often stored during a format almost like an integer; however, attempting to dereference or "look up" a pointer whose value was never a legitimate memory address would cause a program to crash. To ameliorate this potential problem, pointers are considered a separate type to the sort of knowledge they point to, albeit the underlying representation is that the same.


Comments

Popular Posts