Package opennlp.tools.formats
Class TwentyNewsgroupSampleStream
java.lang.Object
opennlp.tools.formats.TwentyNewsgroupSampleStream
- All Implemented Interfaces:
AutoCloseable,ObjectStream<DocumentSample>
An
ObjectStream implementation for the Twenty Newsgroups text corpus.
The document collection was created and donated by: Tom Mitchell, School of Computer Science, Carnegie Mellon University.
Details and the data can be found via this DOI: 10.24432/C5C323.
-
Method Summary
Modifier and TypeMethodDescriptionread()Returns the nextObjectStreamobject.voidreset()Repositions the stream at the beginning and the previously seen object sequence will be repeated exactly.Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface opennlp.tools.util.ObjectStream
close
-
Method Details
-
read
Description copied from interface:ObjectStreamReturns the nextObjectStreamobject. Calling this method repeatedly until it returnsnullwill return each object from the underlying source exactly once.- Specified by:
readin interfaceObjectStream<DocumentSample>- Returns:
- The next object or
nullto signal that the stream is exhausted. - Throws:
IOException- Thrown if there is an error during reading.
-
reset
Description copied from interface:ObjectStreamRepositions the stream at the beginning and the previously seen object sequence will be repeated exactly. This method can be used to re-read the stream if multiple passes over the objects are required.The implementation of this method is optional.
- Specified by:
resetin interfaceObjectStream<DocumentSample>- Throws:
IOException- Thrown if there is an error during resetting the stream.UnsupportedOperationException- Thrown if thereset()is not supported. By default, this is the case.
-