Unit-2 Binary Files: Text File Binary File
Unit-2 Binary Files: Text File Binary File
Unit-2 Binary Files: Text File Binary File
Structure: 2.1 Classification Of Files 2.2 Files Modes 2.3 Standard Library Functions for Files 2.4 File Type Conversion
2.1 Classification Of Files A file is collection of data. A precise definition to files could be A file is a collection of related data stored in auxiliary storage device. This definition includes two additionally and essential attribute of file that is data in file being related and storage device. The mere representation of file in machine is in form of 0s and 1s as shown
EOF Marker
101010001101001010001010101
A proper organization is required so that program can interpret the data while reading. Based on this we have two class of files text and binary files.
101010001101001010001010101
Text File
Binary File
A text file is file of characters. I cannot contain any other data types like int or float. If present stores them in to character equivalent formats. The character her may either be encoded in ASCII or EBCDIC. On the other hand, a binary file is collection of data stored in the internal format of the computer. It can store data belonging to different data types like int, float or even structure but not another file. If the data is textual it represented by 1 bytes, numeric in two or more bytes and so on. Another major difference between the text and binary file is that in record storage. A record is collection of related data, it done in C using structure (struct). Binary file has logical sequence of records. The term binary associated may be misleading, However, In computer memory all files have binary representation. Therefore data stored as text file can also be stroed as binary file. This just depends on how we interpret the data.
2.2 Files Modes Any file has end-of-file (EOF) marker. A file can be opened in read state, write state or in error state. The error state occurs due to illegal operation done either in read or in write modes of file opening. In read state, i.e., file is in read mode, if write operation is performed leads to error state. Similarly, In write state, if read operation is performed it leads to error state. Apart from this, we have append mode where data is written at the end of file, so data is appended. Speaking with respect to our binary file, the modes are rb, wb and ab. Apart from these modes discussed we also have update mode, where we can open the file to perform any operation. There are three update modes r+b, w+b, a+b . In the r+b mode file is initially meant for reading and updating and later we can move to write state by positioning the file. Similarly the other modes.
Files in C can be opened using fopen() function and prototype is given FILE *fopen(const char *filename, const char *mode); The first parameter is the file location or path in which file is located and second parameter is the file modes. Let us summaries the file modes along with there meaning in tabular form. Modes rb Meaning Open the file for reading, Read start from beginning of the file. If the file doesnt exist then error is returned wb Open the file for writing, Write start from beginning of the file. If the file doesnt exist then it is created ab Open the file for append, New records are appended at the end If the file doesnt exist then it is created r+b Open file for both read and write. Initial read begins at start of the file. If the file doesnt exist then error is returned w+b Open file for both read and write, Initial write begins at start of the file, If the file doesnt exist then it is created a+b Open file for both read and write, Write starts at the end, If the file doesnt exist then it is created
2.3 Standard Library Functions for Files Standard library function is the built-in function provided by C for file handling. We discuss this function with respect to our binary file and set of function can be categorized as File open/close Character Input/Output Formatted Input/Output Line Input/Output
Block Input/Output File positioning File status System file operations Opening and closing a file are basic operation performed on file fro which we have fopen() and fclose() function in library. The prototype and it usage in given below FILE *fopen(const char *filename, const char *mode); int fclose(FILE *fp);
Usage:
fp = fopen(abc.bin,rb); fclose(fp);
C make use of block Input/Outputfuction to read an write in to binary files. This is done by fread() and fwrite() function from the library. The prototype and usage is given below. int fread(void *inarea, int item size, int count,FILE *fp); int fwrite(void *inarea, int item size, int count,FILE *fp);
The first parameter is generic pointer to in area or input area from the memory which is usally a structure. The next two parameters are size of each item and number of items. Lastly, the file pointer. Usage: amt_ read = fread(&e,sizeof(e),1,fp); Where e is some structure. C provides three function to determine the file status-test end of file feof(), testerror ferror() and clear error clearerr(). feof() is used to check if the end of file has been reached, if so it returns a true else return false. The prototype of the function is int feof(FILE *fp) Test error is used to check the error status of the file occurred due to various reasons. If error has occurred it return true else false. The prototype is int ferror(FILE *fp) The error status can be reset by using clearer() function and prototype is void clearerr(FILE *fp) The randomness of a file can be seen through the file positioning techniques, this can be don in C using ftell() tell current pointer location, fseek()- moves the pointer to specified position, rewind() moves the file pointer to the beginning of the file. The prototype for functions is given below. long int ftell(FILE *stream); void rewind(FILE *stream); int fseek(FILE *stream, long offset, int wherefrom);
The offset in the fseek() function specified the number of bytes to be moved. The wherefrom indicates in which direction. There are three named constants provided by C they are: SEEK_SET SEEK_CUR SEEK_END 0 1 2
The SEEK_CUR start the displacement from the current position to number of bytes specified. SEEK_END is from the end of the file
Apart from these library file C also provides functions to remove(),rename() and tmpfile(). The prototype is given below. int remove(FILE *stream);
It returns 0 if successfully deleted else returns a non-zero value. int rename(const char *oldfilename, const char *newfilename);
tmpfile() is used to create temporary output file. This can be done like this.
Somewhat of trivial problem is file type conversion i.e.., either from text to binary or vice versa. C does not provide a standard library function to implement this. We describe logic in this section.
We create a binary file from text file by reading a text file either fgets() and separated the input data by using sscanf(). Later put the data to the binary file. The process is repeated until all input from the text file is read. The program for it shown
#include<stdio.h>
char *t2bconv(FILE *txt, EMP *e, FILE *bin) { char *io, buffer[100];
io = fgets(buffer, sizeof(buffer), txt); if(io) { sscanf(buffer,%d%s,e->ssn,e->name); amt =fwrite(&e, sizeof(e), 1, bin); } if(amt!=1) { printf(cantwrite); exit(0); } return io;
if(!(txt = fopen(stud.txt,r)) { printf(\n Cannot open the text fil\n); exit(0); } if(!(bin = fopen(stud.bin,wb)) { printf(\n Cannot open the text fil\n); exit(0); }
while((t2bconv(txt,&e,bin));
fclose(txt); fclose(bin); }
Programmer can also conert from binary to text file. By reading the data from the binary and placing them to text file in their character equivalent form or in ASCII format.
#include <stdio.h>
void writereport(EMP e, FILE *bin) { static int lcount = 50+1; char buffer[100];
if(!(bin = fopen(stud.bin,rb)) { printf(\n Cannot open the bin fil\n); exit(0); } if(!(txt = fopen(stud.txt,w)) { printf(\n Cannot open the text fil\n); exit(0); } e = getdata(txt);
fclose(txt); fclose(bin); }
BIBLIOGRAPHY
1 . A structured programming approach by Belrouz Furuzon 2. Let us C by Yashwant kanetkar 3. Systematic approach to data structures using C by A M Padma Reddy