|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
public interface InputFormat
An input data format. Input files are stored in a FileSystem.
The processing of an input file may be split across multiple machines.
Files are processed as sequences of records, implementing RecordReader. Files must thus be split on record boundaries.
| Method Summary | |
|---|---|
RecordReader |
getRecordReader(InputSplit split,
JobConf job,
Reporter reporter)
Construct a RecordReader for a FileSplit. |
InputSplit[] |
getSplits(JobConf job,
int numSplits)
Splits a set of input files. |
void |
validateInput(JobConf job)
Are the input directories valid? This method is used to test the input directories when a job is submitted so that the framework can fail early with a useful error message when the input directory does not exist. |
| Method Detail |
|---|
void validateInput(JobConf job)
throws IOException
job - the job to check
InvalidInputException - if the job does not have valid input
IOException
InputSplit[] getSplits(JobConf job,
int numSplits)
throws IOException
job - the job whose input files are to be splitnumSplits - the desired number of splits
IOException
RecordReader getRecordReader(InputSplit split,
JobConf job,
Reporter reporter)
throws IOException
RecordReader for a FileSplit.
split - the InputSplitjob - the job that this split belongs to
RecordReader
IOException
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||