org.apache.hadoop.mapred
Class TextInputFormat
java.lang.Object
org.apache.hadoop.mapred.FileInputFormat
org.apache.hadoop.mapred.TextInputFormat
- All Implemented Interfaces:
- InputFormat, JobConfigurable
- Direct Known Subclasses:
- KeyValueTextInputFormat
public class TextInputFormat
- extends FileInputFormat
- implements JobConfigurable
An InputFormat
for plain text files. Files are broken into lines.
Either linefeed or carriage-return are used to signal end of line. Keys are
the position in the file, and values are the line of text..
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
TextInputFormat
public TextInputFormat()
configure
public void configure(JobConf conf)
- Description copied from interface:
JobConfigurable
- Initializes a new instance from a
JobConf
.
- Specified by:
configure
in interface JobConfigurable
- Parameters:
conf
- the configuration
isSplitable
protected boolean isSplitable(FileSystem fs,
Path file)
- Description copied from class:
FileInputFormat
- Is the given filename splitable? Usually, true, but if the file is
stream compressed, it will not be.
- Overrides:
isSplitable
in class FileInputFormat
- Parameters:
fs
- the file system that the file is onfile
- the file name to check
- Returns:
- is this file splitable?
getRecordReader
public RecordReader getRecordReader(InputSplit genericSplit,
JobConf job,
Reporter reporter)
throws IOException
- Description copied from interface:
InputFormat
- Construct a
RecordReader
for a FileSplit
.
- Specified by:
getRecordReader
in interface InputFormat
- Specified by:
getRecordReader
in class FileInputFormat
- Parameters:
genericSplit
- the InputSplit
job
- the job that this split belongs to
- Returns:
- a
RecordReader
- Throws:
IOException
Copyright © 2006 The Apache Software Foundation