A B C D E F G H I K L M N O P R S T U W X Z

A

add(String, String) - Method in class org.apache.tika.metadata.Metadata
Add a metadata name/value mapping.
add(String, String) - Method in class org.apache.tika.metadata.SpellCheckedMetadata
 
addAlias(MediaType, MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
 
addAlias(String) - Method in class org.apache.tika.mime.MimeType
Adds an alias name for this media type.
addMetadata(String) - Method in class org.apache.tika.parser.xml.MetadataHandler
 
addPattern(MimeType, String) - Method in class org.apache.tika.mime.MimeTypes
Adds a file name pattern for the given media type.
addPrefix(String, String) - Method in class org.apache.tika.sax.xpath.XPathParser
 
APPLICATION_NAME - Static variable in interface org.apache.tika.metadata.MSOffice
 
APPLICATION_XML - Static variable in class org.apache.tika.mime.MediaType
 
AttributeMatcher - Class in org.apache.tika.sax.xpath
Final evaluation state of a .../@* XPath expression.
AttributeMatcher() - Constructor for class org.apache.tika.sax.xpath.AttributeMatcher
 
AudioParser - Class in org.apache.tika.parser.audio
 
AudioParser() - Constructor for class org.apache.tika.parser.audio.AudioParser
 
AUTHOR - Static variable in interface org.apache.tika.metadata.MSOffice
 
AutoDetectParser - Class in org.apache.tika.parser
 
AutoDetectParser() - Constructor for class org.apache.tika.parser.AutoDetectParser
Creates an auto-detecting parser instance using the default Tika configuration.
AutoDetectParser(TikaConfig) - Constructor for class org.apache.tika.parser.AutoDetectParser
 
available() - Method in class org.apache.tika.parser.pkg.tar.TarInputStream
Get the available data that can be read from the current entry in the archive.

B

baseBlockSize - Static variable in interface org.apache.tika.parser.pkg.bzip2.BZip2Constants
 
BodyContentHandler - Class in org.apache.tika.sax
Content handler decorator that only passes everything inside the XHTML <body/> tag to the underlying handler.
BodyContentHandler(ContentHandler) - Constructor for class org.apache.tika.sax.BodyContentHandler
Creates a content handler that passes all XHTML body events to the given underlying content handler.
BodyContentHandler(Writer) - Constructor for class org.apache.tika.sax.BodyContentHandler
Creates a content handler that writes XHTML body character events to the given writer.
BodyContentHandler(OutputStream) - Constructor for class org.apache.tika.sax.BodyContentHandler
Creates a content handler that writes XHTML body character events to the given output stream using the default encoding.
BodyContentHandler() - Constructor for class org.apache.tika.sax.BodyContentHandler
Creates a content handler that writes XHTML body character events to an internal string buffer.
buffer - Variable in class org.apache.tika.parser.pkg.tar.TarInputStream
 
BZip2Constants - Interface in org.apache.tika.parser.pkg.bzip2
Base class for both the compress and decompress classes.
Bzip2Parser - Class in org.apache.tika.parser.pkg
Gzip parser.
Bzip2Parser() - Constructor for class org.apache.tika.parser.pkg.Bzip2Parser
 

C

CBZip2InputStream - Class in org.apache.tika.parser.pkg.bzip2
An input stream that decompresses from the BZip2 format (without the file header chars) to be read as any other stream.
CBZip2InputStream(InputStream) - Constructor for class org.apache.tika.parser.pkg.bzip2.CBZip2InputStream
Constructs a new CBZip2InputStream which decompresses bytes read from the specified stream.
Cell - Interface in org.apache.tika.parser.microsoft
Cell of content.
CellDecorator - Class in org.apache.tika.parser.microsoft
Cell decorator.
CellDecorator(Cell) - Constructor for class org.apache.tika.parser.microsoft.CellDecorator
 
CHARACTER_COUNT - Static variable in interface org.apache.tika.metadata.MSOffice
 
characters(char[], int, int) - Method in class org.apache.tika.parser.xml.MetadataHandler
 
characters(char[], int, int) - Method in class org.apache.tika.sax.ContentHandlerDecorator
 
characters(char[], int, int) - Method in class org.apache.tika.sax.TeeContentHandler
 
characters(char[], int, int) - Method in class org.apache.tika.sax.TextContentHandler
 
characters(char[], int, int) - Method in class org.apache.tika.sax.WriteOutContentHandler
Writes the given characters to the given character stream.
characters(String) - Method in class org.apache.tika.sax.XHTMLContentHandler
 
characters(char[], int, int) - Method in class org.apache.tika.sax.xpath.MatchingContentHandler
 
ChildMatcher - Class in org.apache.tika.sax.xpath
Intermediate evaluation state of a .../*... XPath expression.
ChildMatcher(Matcher) - Constructor for class org.apache.tika.sax.xpath.ChildMatcher
 
CHKSUMLEN - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
The length of the checksum field in a header buffer.
ClassParser - Class in org.apache.tika.parser.asm
Parser for Java .class files.
ClassParser() - Constructor for class org.apache.tika.parser.asm.ClassParser
 
close() - Method in class org.apache.tika.parser.ParsingReader
Closes the read end of the pipe.
close() - Method in class org.apache.tika.parser.pkg.bzip2.CBZip2InputStream
 
close() - Method in class org.apache.tika.parser.pkg.tar.TarBuffer
Close the TarBuffer.
close() - Method in class org.apache.tika.parser.pkg.tar.TarInputStream
Closes this stream.
close() - Method in class org.apache.tika.utils.RereadableInputStream
Closes the input stream and removes the temporary file if one was created.
COMMENTS - Static variable in interface org.apache.tika.metadata.MSOffice
 
compareTo(MimeType) - Method in class org.apache.tika.mime.MimeType
 
CompositeMatcher - Class in org.apache.tika.sax.xpath
Composite XPath evaluation state.
CompositeMatcher(Matcher, Matcher) - Constructor for class org.apache.tika.sax.xpath.CompositeMatcher
 
CompositeParser - Class in org.apache.tika.parser
Composite parser that delegates parsing tasks to a component parser based on the declared content type of the incoming document.
CompositeParser() - Constructor for class org.apache.tika.parser.CompositeParser
 
computeCheckSum(byte[]) - Static method in class org.apache.tika.parser.pkg.tar.TarUtils
Compute the checksum of a tar entry header.
CONTENT_DISPOSITION - Static variable in interface org.apache.tika.metadata.HttpHeaders
 
CONTENT_ENCODING - Static variable in interface org.apache.tika.metadata.HttpHeaders
 
CONTENT_LANGUAGE - Static variable in interface org.apache.tika.metadata.HttpHeaders
 
CONTENT_LENGTH - Static variable in interface org.apache.tika.metadata.HttpHeaders
 
CONTENT_LOCATION - Static variable in interface org.apache.tika.metadata.HttpHeaders
 
CONTENT_MD5 - Static variable in interface org.apache.tika.metadata.HttpHeaders
 
CONTENT_TYPE - Static variable in interface org.apache.tika.metadata.HttpHeaders
 
ContentHandlerDecorator - Class in org.apache.tika.sax
Decorator base class for the ContentHandler interface.
ContentHandlerDecorator(ContentHandler) - Constructor for class org.apache.tika.sax.ContentHandlerDecorator
Creates a decorator for the given SAX event handler.
CONTRIBUTOR - Static variable in interface org.apache.tika.metadata.DublinCore
An entity responsible for making contributions to the content of the resource.
copyEntryContents(OutputStream) - Method in class org.apache.tika.parser.pkg.tar.TarInputStream
Copies the contents of the current tar archive entry directly into an output stream.
COVERAGE - Static variable in interface org.apache.tika.metadata.DublinCore
The extent or scope of the content of the resource.
create() - Static method in class org.apache.tika.mime.MimeTypesFactory
Creates an empty instance; same as calling new MimeTypes().
create(Document) - Static method in class org.apache.tika.mime.MimeTypesFactory
Creates and returns a MimeTypes instance from the specified document.
create(InputStream) - Static method in class org.apache.tika.mime.MimeTypesFactory
Creates and returns a MimeTypes instance from the specified input stream.
create(URL) - Static method in class org.apache.tika.mime.MimeTypesFactory
Creates and returns a MimeTypes instance from the resource at the location specified by the URL.
create(String) - Static method in class org.apache.tika.mime.MimeTypesFactory
Creates and returns a MimeTypes instance from the specified file path, as interpreted by the class loader in getResource().
CreativeCommons - Interface in org.apache.tika.metadata
A collection of Creative Commons properties names.
CREATOR - Static variable in interface org.apache.tika.metadata.DublinCore
An entity primarily responsible for making the content of the resource.
currEntry - Variable in class org.apache.tika.parser.pkg.tar.TarInputStream
 

D

DATE - Static variable in interface org.apache.tika.metadata.DublinCore
A date associated with an event in the life cycle of the resource.
DcXMLParser - Class in org.apache.tika.parser.xml
Dublin Core metadata parser
DcXMLParser() - Constructor for class org.apache.tika.parser.xml.DcXMLParser
 
debug - Variable in class org.apache.tika.parser.pkg.tar.TarInputStream
 
decode(String) - Static method in class org.apache.tika.mime.HexCoDec
Decode a hex string
decode(char[]) - Static method in class org.apache.tika.mime.HexCoDec
Decode an array of hex chars
decode(char[], int, int) - Static method in class org.apache.tika.mime.HexCoDec
Decode an array of hex chars.
DEFAULT - Static variable in class org.apache.tika.mime.MimeTypes
The default application/octet-stream MimeType
DEFAULT_BLKSIZE - Static variable in class org.apache.tika.parser.pkg.tar.TarBuffer
Default block size
DEFAULT_CONFIG_LOCATION - Static variable in class org.apache.tika.config.TikaConfig
 
DEFAULT_DIR_MODE - Static variable in class org.apache.tika.parser.pkg.tar.TarEntry
Default permissions bits for directories
DEFAULT_FILE_MODE - Static variable in class org.apache.tika.parser.pkg.tar.TarEntry
Default permissions bits for files
DEFAULT_RCDSIZE - Static variable in class org.apache.tika.parser.pkg.tar.TarBuffer
Default record size
descend(String, String) - Method in class org.apache.tika.sax.xpath.ChildMatcher
 
descend(String, String) - Method in class org.apache.tika.sax.xpath.CompositeMatcher
 
descend(String, String) - Method in class org.apache.tika.sax.xpath.Matcher
Returns the XPath evaluation state that results from descending to a child element with the given name.
descend(String, String) - Method in class org.apache.tika.sax.xpath.NamedElementMatcher
 
descend(String, String) - Method in class org.apache.tika.sax.xpath.SubtreeMatcher
 
DESCRIPTION - Static variable in interface org.apache.tika.metadata.DublinCore
An account of the content of the resource.
DEVLEN - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
The length of the devices field in a header buffer.
DublinCore - Interface in org.apache.tika.metadata
A collection of Dublin Core metadata names.

E

element(String, String) - Method in class org.apache.tika.sax.XHTMLContentHandler
 
ElementMatcher - Class in org.apache.tika.sax.xpath
Final evaluation state of an XPath expression that targets an element.
ElementMatcher() - Constructor for class org.apache.tika.sax.xpath.ElementMatcher
 
EmptyParser - Class in org.apache.tika.parser
Dummy parser that always produces an empty XHTML document without even attempting to parse the given document stream.
EmptyParser() - Constructor for class org.apache.tika.parser.EmptyParser
 
encode(byte[]) - Static method in class org.apache.tika.mime.HexCoDec
Hex encode an array of bytes
encode(byte[], int, int) - Static method in class org.apache.tika.mime.HexCoDec
Hex encode an array of bytes
endDocument() - Method in class org.apache.tika.sax.ContentHandlerDecorator
 
endDocument() - Method in class org.apache.tika.sax.TeeContentHandler
 
endDocument() - Method in class org.apache.tika.sax.XHTMLContentHandler
Ends the XHTML document by writing the following footer and clearing the namespace mappings:
endDocument() - Method in class org.apache.tika.sax.xpath.MatchingContentHandler
Ignored.
endElement(String, String, String) - Method in class org.apache.tika.parser.xml.MetadataHandler
 
endElement(String, String, String) - Method in class org.apache.tika.sax.ContentHandlerDecorator
 
endElement(String, String, String) - Method in class org.apache.tika.sax.TeeContentHandler
 
endElement(String) - Method in class org.apache.tika.sax.XHTMLContentHandler
 
endElement(String, String, String) - Method in class org.apache.tika.sax.xpath.MatchingContentHandler
 
endPrefixMapping(String) - Method in class org.apache.tika.sax.ContentHandlerDecorator
 
endPrefixMapping(String) - Method in class org.apache.tika.sax.TeeContentHandler
 
entryOffset - Variable in class org.apache.tika.parser.pkg.tar.TarInputStream
 
entrySize - Variable in class org.apache.tika.parser.pkg.tar.TarInputStream
 
equals(Object) - Method in class org.apache.tika.metadata.Metadata
 
equals(Object) - Method in class org.apache.tika.mime.MediaType
 
equals(TarEntry) - Method in class org.apache.tika.parser.pkg.tar.TarEntry
Determine if the two entries are equal.
equals(Object) - Method in class org.apache.tika.parser.pkg.tar.TarEntry
Determine if the two entries are equal.
ErrorParser - Class in org.apache.tika.parser
Dummy parser that always throws a TikaException without even attempting to parse the given document stream.
ErrorParser() - Constructor for class org.apache.tika.parser.ErrorParser
 
ExcelExtractor - Class in org.apache.tika.parser.microsoft
Excel parser implementation which uses POI's Event API to handle the contents of a Workbook.
ExcelExtractor() - Constructor for class org.apache.tika.parser.microsoft.ExcelExtractor
 
ExternalParser - Class in org.apache.tika.parser
Parser that uses an external program (like catdoc or pdf2txt) to extract text content from a given document.
ExternalParser() - Constructor for class org.apache.tika.parser.ExternalParser
 
extractLinks(String) - Static method in class org.apache.tika.utils.RegexUtils
Extract urls from plain text.

F

FAIL - Static variable in class org.apache.tika.sax.xpath.Matcher
State of a failed XPath evaluation, where nothing is matched.
FORMAT - Static variable in interface org.apache.tika.metadata.DublinCore
Typically, Format may include the media-type or dimensions of the resource.
forName(String) - Method in class org.apache.tika.mime.MimeTypes
Returns the registered media type with the given name (or alias).
fromHexString(String) - Static method in class org.apache.tika.utils.StringUtil
Convert a String containing consecutive (no inside whitespace) hexadecimal digits into a corresponding byte array.

G

G_SIZE - Static variable in interface org.apache.tika.parser.pkg.bzip2.BZip2Constants
 
get(String) - Method in class org.apache.tika.metadata.Metadata
Get the value associated to a metadata name.
get(String) - Method in class org.apache.tika.metadata.SpellCheckedMetadata
 
getAliases() - Method in class org.apache.tika.mime.MimeType
Returns the aliases of this media type.
getBaseType() - Method in class org.apache.tika.mime.MediaType
 
getBlockSize() - Method in class org.apache.tika.parser.pkg.tar.TarBuffer
Get the TAR Buffer's block size.
getCheckSumOctalBytes(long, byte[], int, int) - Static method in class org.apache.tika.parser.pkg.tar.TarUtils
Parse the checksum octal integer from a header buffer.
getContentParser() - Method in class org.apache.tika.parser.opendocument.OpenOfficeParser
 
getCurrentBlockNum() - Method in class org.apache.tika.parser.pkg.tar.TarBuffer
Get the current block number, zero based.
getCurrentRecordNum() - Method in class org.apache.tika.parser.pkg.tar.TarBuffer
Get the current record number, within the current block, zero based.
getDefaultConfig() - Static method in class org.apache.tika.config.TikaConfig
Provides a default configuration (TikaConfig).
getDefaultHandler(ContentHandler, Metadata) - Method in class org.apache.tika.parser.opendocument.OpenOfficeContentParser
 
getDefaultHandler(ContentHandler, Metadata) - Method in class org.apache.tika.parser.opendocument.OpenOfficeMetaParser
 
getDefaultHandler(ContentHandler, Metadata) - Method in class org.apache.tika.parser.xml.DcXMLParser
 
getDefaultHandler(ContentHandler, Metadata) - Method in class org.apache.tika.parser.xml.XMLParser
 
getDescription() - Method in class org.apache.tika.mime.MimeType
Returns the description of this media type.
getDirectoryEntries() - Method in class org.apache.tika.parser.pkg.tar.TarEntry
If this entry represents a file, and the file is a directory, return an array of TarEntries for this entry's children.
getFallback() - Method in class org.apache.tika.parser.CompositeParser
Returns the fallback parser.
getFile() - Method in class org.apache.tika.parser.pkg.tar.TarEntry
Get this entry's file.
getGroupId() - Method in class org.apache.tika.parser.pkg.tar.TarEntry
Get this entry's group id.
getGroupName() - Method in class org.apache.tika.parser.pkg.tar.TarEntry
Get this entry's group name.
getLinkName() - Method in class org.apache.tika.parser.pkg.tar.TarEntry
Get this entry's link name.
getLongOctalBytes(long, byte[], int, int) - Static method in class org.apache.tika.parser.pkg.tar.TarUtils
Parse an octal long integer from a header buffer.
getMetaParser() - Method in class org.apache.tika.parser.opendocument.OpenOfficeParser
 
getMimeRepository() - Method in class org.apache.tika.config.TikaConfig
 
getMimeType(File) - Method in class org.apache.tika.mime.MimeTypes
Find the Mime Content Type of a file.
getMimeType(URL) - Method in class org.apache.tika.mime.MimeTypes
Find the Mime Content Type of a document from its URL.
getMimeType(String) - Method in class org.apache.tika.mime.MimeTypes
Find the Mime Content Type of a document from its name.
getMimeType(byte[]) - Method in class org.apache.tika.mime.MimeTypes
Returns the MIME type that best matches the given first few bytes of a document stream.
getMimeType(InputStream) - Method in class org.apache.tika.mime.MimeTypes
Returns the MIME type that best matches the first few bytes of the given document stream.
getMimeType(String, byte[]) - Method in class org.apache.tika.mime.MimeTypes
Find the Mime Content Type of a document from its name and its content.
getMimeType(String, InputStream) - Method in class org.apache.tika.mime.MimeTypes
Returns the MIME type that best matches the given document name and the first few bytes of the given document stream.
getMimeTypes() - Method in class org.apache.tika.parser.AutoDetectParser
 
getMinLength() - Method in class org.apache.tika.mime.MimeTypes
Return the minimum length of data to provide to analyzing methods based on the document's content in order to check all the known MimeTypes.
getMode() - Method in class org.apache.tika.parser.pkg.tar.TarEntry
Get this entry's mode.
getModTime() - Method in class org.apache.tika.parser.pkg.tar.TarEntry
Set this entry's modification time.
getName() - Method in class org.apache.tika.mime.MimeType
Returns the name of this media type.
getName() - Method in class org.apache.tika.parser.pkg.tar.TarEntry
Get this entry's name.
getNameBytes(StringBuffer, byte[], int, int) - Static method in class org.apache.tika.parser.pkg.tar.TarUtils
Determine the number of bytes in an entry name.
getNextEntry() - Method in class org.apache.tika.parser.pkg.tar.TarInputStream
Get the next entry in this tar archive.
getNormalizedName(String) - Static method in class org.apache.tika.metadata.SpellCheckedMetadata
Get the normalized name of metadata attribute name.
getOctalBytes(long, byte[], int, int) - Static method in class org.apache.tika.parser.pkg.tar.TarUtils
Parse an octal integer from a header buffer.
getParameters() - Method in class org.apache.tika.mime.MediaType
 
getParser(String) - Method in class org.apache.tika.config.TikaConfig
Returns the parser instance configured for the given MIME type.
getParser(Metadata) - Method in class org.apache.tika.parser.CompositeParser
Returns the parser that best matches the given metadata.
getParser() - Method in class org.apache.tika.parser.pkg.PackageParser
Returns the parser instance used to parse package entries.
getParser(String, TikaConfig) - Static method in class org.apache.tika.utils.ParseUtils
Returns a parser that can handle the specified MIME type, and is set to receive input from a stream opened from the specified URL.
getParser(URL, TikaConfig) - Static method in class org.apache.tika.utils.ParseUtils
Returns a parser that can handle the specified MIME type, and is set to receive input from a stream opened from the specified URL.
getParser(File, TikaConfig) - Static method in class org.apache.tika.utils.ParseUtils
Returns a parser that can handle the specified MIME type, and is set to receive input from a stream opened from the specified URL.
getParsers() - Method in class org.apache.tika.config.TikaConfig
 
getParsers() - Method in class org.apache.tika.parser.CompositeParser
Returns the component parsers.
getParsersFromZip(File, TikaConfig) - Static method in class org.apache.tika.utils.ParseUtils
Returns a list of parsers from zip File
getParsersFromZip(URL, TikaConfig) - Static method in class org.apache.tika.utils.ParseUtils
Returns a list of parsers from URL
getRecordSize() - Method in class org.apache.tika.parser.pkg.tar.TarBuffer
Get the TAR Buffer's record size.
getRecordSize() - Method in class org.apache.tika.parser.pkg.tar.TarInputStream
Get the record size being used by this stream's TarBuffer.
getSize() - Method in class org.apache.tika.parser.pkg.tar.TarEntry
Get this entry's file size.
getSize() - Method in class org.apache.tika.utils.RereadableInputStream
Returns the number of bytes read from the original stream.
getStringContent(InputStream, TikaConfig, String) - Static method in class org.apache.tika.utils.ParseUtils
Gets the string content of a document read from an input stream.
getStringContent(URL, TikaConfig) - Static method in class org.apache.tika.utils.ParseUtils
Gets the string content of a document read from an input stream.
getStringContent(URL, TikaConfig, String) - Static method in class org.apache.tika.utils.ParseUtils
Gets the string content of a document read from an input stream.
getStringContent(File, TikaConfig, String) - Static method in class org.apache.tika.utils.ParseUtils
Gets the string content of a document read from an input stream.
getStringContent(File, TikaConfig) - Static method in class org.apache.tika.utils.ParseUtils
Gets the string content of a document read from an input stream.
getSubtype() - Method in class org.apache.tika.mime.MediaType
 
getSubTypes() - Method in class org.apache.tika.mime.MimeType
 
getSuperType() - Method in class org.apache.tika.mime.MimeType
Returns the parent of this media type.
getType() - Method in class org.apache.tika.mime.MediaType
 
getType(String, String, byte[]) - Method in class org.apache.tika.mime.MimeTypes
 
getType(URL) - Method in class org.apache.tika.mime.MimeTypes
Determines the MIME type of the resource pointed to by the specified URL.
getUserId() - Method in class org.apache.tika.parser.pkg.tar.TarEntry
Get this entry's user id.
getUserName() - Method in class org.apache.tika.parser.pkg.tar.TarEntry
Get this entry's user name.
getUTF8Reader(InputStream, Metadata) - Static method in class org.apache.tika.utils.Utils
Try to detect encoding from inputstream and return a UTF-8 Reader.
getValues(String) - Method in class org.apache.tika.metadata.Metadata
Get the values associated to a metadata name.
getValues(String) - Method in class org.apache.tika.metadata.SpellCheckedMetadata
 
GIDLEN - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
The length of the group id field in a header buffer.
GNAMELEN - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
The length of the group name field in a header buffer.
GNU_LONGLINK - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
The namr of the GNU tar entry which contains a long name.
GNU_TMAGIC - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
The magic tag representing a GNU tar archive.
GzipParser - Class in org.apache.tika.parser.pkg
Gzip parser.
GzipParser() - Constructor for class org.apache.tika.parser.pkg.GzipParser
 

H

hashCode() - Method in class org.apache.tika.mime.MediaType
 
hashCode() - Method in class org.apache.tika.parser.pkg.tar.TarEntry
Hashcodes are based on entry names.
hasHitEOF - Variable in class org.apache.tika.parser.pkg.tar.TarInputStream
 
hasMagic() - Method in class org.apache.tika.mime.MimeType
 
HexCoDec - Class in org.apache.tika.mime
A set of Hex encoding and decoding utility methods.
HexCoDec() - Constructor for class org.apache.tika.mime.HexCoDec
 
HtmlParser - Class in org.apache.tika.parser.html
HTML parser.
HtmlParser() - Constructor for class org.apache.tika.parser.html.HtmlParser
 
HttpHeaders - Interface in org.apache.tika.metadata
A collection of HTTP header names.

I

IDENTIFIER - Static variable in interface org.apache.tika.metadata.DublinCore
Recommended best practice is to identify the resource by means of a string or number conforming to a formal identification system.
ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.ContentHandlerDecorator
 
ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.TeeContentHandler
 
ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.TextContentHandler
 
ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.xpath.MatchingContentHandler
 
ImageParser - Class in org.apache.tika.parser.image
 
ImageParser() - Constructor for class org.apache.tika.parser.image.ImageParser
 
importFile(File) - Method in class org.apache.tika.gui.TikaGUI
 
INSTANCE - Static variable in class org.apache.tika.sax.xpath.AttributeMatcher
 
INSTANCE - Static variable in class org.apache.tika.sax.xpath.ElementMatcher
 
INSTANCE - Static variable in class org.apache.tika.sax.xpath.NodeMatcher
 
INSTANCE - Static variable in class org.apache.tika.sax.xpath.TextMatcher
 
isDescendantOf(MimeType) - Method in class org.apache.tika.mime.MimeType
 
isDescendent(TarEntry) - Method in class org.apache.tika.parser.pkg.tar.TarEntry
Determine if the given entry is a descendant of this entry.
isDirectory() - Method in class org.apache.tika.parser.pkg.tar.TarEntry
Return whether or not this entry represents a directory.
isEmpty(String) - Static method in class org.apache.tika.utils.StringUtil
Checks if a string is empty (ie is null or empty).
isEOFRecord(byte[]) - Method in class org.apache.tika.parser.pkg.tar.TarBuffer
Determine if an archive record indicate End of Archive.
isGNULongNameEntry() - Method in class org.apache.tika.parser.pkg.tar.TarEntry
Indicate if this entry is a GNU long name block
isListenForAllRecords() - Method in class org.apache.tika.parser.microsoft.ExcelExtractor
Returns true if this parser is configured to listen for all records instead of just the specified few.
isMultiValued(String) - Method in class org.apache.tika.metadata.Metadata
Returns true if named value is multivalued.
isSpecializationOf(MediaType) - Method in class org.apache.tika.mime.MediaType
 
isValid(String) - Static method in class org.apache.tika.mime.MimeType
Checks that the given string is a valid Internet media type name based on rules from RFC 2054 section 5.3.

K

KEYWORDS - Static variable in interface org.apache.tika.metadata.MSOffice
 

L

LANGUAGE - Static variable in interface org.apache.tika.metadata.DublinCore
A language of the intellectual content of the resource.
LAST_AUTHOR - Static variable in interface org.apache.tika.metadata.MSOffice
 
LAST_MODIFIED - Static variable in interface org.apache.tika.metadata.HttpHeaders
 
LAST_PRINTED - Static variable in interface org.apache.tika.metadata.MSOffice
 
LAST_SAVED - Static variable in interface org.apache.tika.metadata.MSOffice
 
leftPad(String, int) - Static method in class org.apache.tika.utils.StringUtil
Returns a copy of s padded with leading spaces so that it's length is length.
LF_BLK - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
Block device file type.
LF_CHR - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
Character device file type.
LF_CONTIG - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
Contiguous file type.
LF_DIR - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
Directory file type.
LF_FIFO - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
FIFO (pipe) file type.
LF_GNUTYPE_LONGNAME - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
Identifies the *next* file on the tape as having a long name.
LF_LINK - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
Link file type.
LF_NORMAL - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
Normal file type.
LF_OLDNORM - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
LF_ constants represent the "link flag" of an entry, or more commonly, the "entry type".
LF_SYMLINK - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
Symbolic link file type.
LICENSE_LOCATION - Static variable in interface org.apache.tika.metadata.CreativeCommons
 
LICENSE_URL - Static variable in interface org.apache.tika.metadata.CreativeCommons
 
LinkedCell - Class in org.apache.tika.parser.microsoft
Linked cell.
LinkedCell(Cell, String) - Constructor for class org.apache.tika.parser.microsoft.LinkedCell
 
LOCATION - Static variable in interface org.apache.tika.metadata.HttpHeaders
 

M

MAGICLEN - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
The length of the magic field in a header buffer.
main(String[]) - Static method in class org.apache.tika.cli.TikaCLI
 
main(String[]) - Static method in class org.apache.tika.gui.TikaGUI
Main method.
main(String[]) - Static method in class org.apache.tika.utils.StringUtil
 
mark(int) - Method in class org.apache.tika.parser.pkg.tar.TarInputStream
Since we do not support marking just yet, we do nothing.
markSupported() - Method in class org.apache.tika.parser.pkg.tar.TarInputStream
Since we do not support marking just yet, we return false.
Matcher - Class in org.apache.tika.sax.xpath
XPath element matcher.
Matcher() - Constructor for class org.apache.tika.sax.xpath.Matcher
 
matches(byte[]) - Method in class org.apache.tika.mime.MimeType
 
matchesAttribute(String, String) - Method in class org.apache.tika.sax.xpath.AttributeMatcher
 
matchesAttribute(String, String) - Method in class org.apache.tika.sax.xpath.CompositeMatcher
 
matchesAttribute(String, String) - Method in class org.apache.tika.sax.xpath.Matcher
Returns true if the XPath expression matches the named attribute of the element associated with this evaluation state.
matchesAttribute(String, String) - Method in class org.apache.tika.sax.xpath.NamedAttributeMatcher
 
matchesAttribute(String, String) - Method in class org.apache.tika.sax.xpath.NodeMatcher
 
matchesAttribute(String, String) - Method in class org.apache.tika.sax.xpath.SubtreeMatcher
 
matchesElement() - Method in class org.apache.tika.sax.xpath.CompositeMatcher
 
matchesElement() - Method in class org.apache.tika.sax.xpath.ElementMatcher
 
matchesElement() - Method in class org.apache.tika.sax.xpath.Matcher
Returns true if the XPath expression matches the element associated with this evaluation state.
matchesElement() - Method in class org.apache.tika.sax.xpath.NodeMatcher
 
matchesElement() - Method in class org.apache.tika.sax.xpath.SubtreeMatcher
 
matchesMagic(byte[]) - Method in class org.apache.tika.mime.MimeType
 
matchesText() - Method in class org.apache.tika.sax.xpath.CompositeMatcher
 
matchesText() - Method in class org.apache.tika.sax.xpath.Matcher
Returns true if the XPath expression matches all text nodes whose parent is the element associated with this evaluation state.
matchesText() - Method in class org.apache.tika.sax.xpath.NodeMatcher
 
matchesText() - Method in class org.apache.tika.sax.xpath.SubtreeMatcher
 
matchesText() - Method in class org.apache.tika.sax.xpath.TextMatcher
 
MatchingContentHandler - Class in org.apache.tika.sax.xpath
Content handler decorator that only passes the elements, attributes, and text nodes that match the given XPath expression.
MatchingContentHandler(ContentHandler, Matcher) - Constructor for class org.apache.tika.sax.xpath.MatchingContentHandler
 
MAX_ALPHA_SIZE - Static variable in interface org.apache.tika.parser.pkg.bzip2.BZip2Constants
 
MAX_CODE_LEN - Static variable in interface org.apache.tika.parser.pkg.bzip2.BZip2Constants
 
MAX_NAMELEN - Static variable in class org.apache.tika.parser.pkg.tar.TarEntry
Maximum length of a user's name in the tar file
MAX_SELECTORS - Static variable in interface org.apache.tika.parser.pkg.bzip2.BZip2Constants
 
MAXSIZE - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
The maximum size of a file in a tar archive (That's 11 sevens, octal).
MediaType - Class in org.apache.tika.mime
Internet media type.
MediaType(String, String, Map<String, String>) - Constructor for class org.apache.tika.mime.MediaType
 
MediaType(String, String) - Constructor for class org.apache.tika.mime.MediaType
 
MediaType(MediaType, Map<String, String>) - Constructor for class org.apache.tika.mime.MediaType
 
MediaTypeRegistry - Class in org.apache.tika.mime
Registry of Internet media types.
MediaTypeRegistry() - Constructor for class org.apache.tika.mime.MediaTypeRegistry
 
Metadata - Class in org.apache.tika.metadata
A multi-valued metadata container.
Metadata() - Constructor for class org.apache.tika.metadata.Metadata
Constructs a new, empty metadata.
MetadataHandler - Class in org.apache.tika.parser.xml
 
MetadataHandler(Metadata, String) - Constructor for class org.apache.tika.parser.xml.MetadataHandler
 
MidiParser - Class in org.apache.tika.parser.audio
 
MidiParser() - Constructor for class org.apache.tika.parser.audio.MidiParser
 
MILLIS_PER_SECOND - Static variable in class org.apache.tika.parser.pkg.tar.TarEntry
Convert millis to seconds
MIME_TYPE_MAGIC - Static variable in interface org.apache.tika.metadata.TikaMimeKeys
 
MimeType - Class in org.apache.tika.mime
Internet media type.
MimeTypeException - Exception in org.apache.tika.mime
A class to encapsulate MimeType related exceptions.
MimeTypeException() - Constructor for exception org.apache.tika.mime.MimeTypeException
Constructs a MimeTypeException with no specified detail message.
MimeTypeException(String) - Constructor for exception org.apache.tika.mime.MimeTypeException
Constructs a MimeTypeException with the specified detail message.
MimeTypeException(Throwable) - Constructor for exception org.apache.tika.mime.MimeTypeException
Constructs a MimeTypeException with the specified cause.
MimeTypes - Class in org.apache.tika.mime
This class is a MimeType repository.
MimeTypes() - Constructor for class org.apache.tika.mime.MimeTypes
 
MimeTypesFactory - Class in org.apache.tika.mime
Creates instances of MimeTypes.
MimeTypesFactory() - Constructor for class org.apache.tika.mime.MimeTypesFactory
 
MODELEN - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
The length of the mode field in a header buffer.
MODIFIED - Static variable in interface org.apache.tika.metadata.DublinCore
Date on which the resource was changed.
MODTIMELEN - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
The length of the modification time field in a header buffer.
Mp3Parser - Class in org.apache.tika.parser.mp3
The Mp3Parser is used to parse ID3 Version 1 Tag information from an MP3 file, if available.
Mp3Parser() - Constructor for class org.apache.tika.parser.mp3.Mp3Parser
 
MSOffice - Interface in org.apache.tika.metadata
A collection of "Office" documents properties names.

N

N_GROUPS - Static variable in interface org.apache.tika.parser.pkg.bzip2.BZip2Constants
 
N_ITERS - Static variable in interface org.apache.tika.parser.pkg.bzip2.BZip2Constants
 
NamedAttributeMatcher - Class in org.apache.tika.sax.xpath
Final evaluation state of a .../@name XPath expression.
NamedAttributeMatcher(String, String) - Constructor for class org.apache.tika.sax.xpath.NamedAttributeMatcher
 
NamedElementMatcher - Class in org.apache.tika.sax.xpath
Intermediate evaluation state of a .../name... XPath expression.
NamedElementMatcher(String, String, Matcher) - Constructor for class org.apache.tika.sax.xpath.NamedElementMatcher
 
NAMELEN - Static variable in interface org.apache.tika.parser.pkg.tar.TarConstants
The length of the name field in a header buffer.
names() - Method in class org.apache.tika.metadata.Metadata
Returns an array of the names contained in the metadata.
NodeMatcher - Class in org.apache.tika.sax.xpath
Final evaluation state of a .../node() XPath expression.
NodeMatcher() - Constructor for class org.apache.tika.sax.xpath.NodeMatcher
 
NUM_OVERSHOOT_BYTES - Static variable in interface org.apache.tika.parser.pkg.bzip2.BZip2Constants
 
NumberCell - Class in org.apache.tika.parser.microsoft
Number cell.
NumberCell(double, NumberFormat) - Constructor for class org.apache.tika.parser.microsoft.NumberCell
 
NumberCell(double) - Constructor for class org.apache.tika.parser.microsoft.NumberCell
 

O

OCTET_STREAM - Static variable in class org.apache.tika.mime.MediaType
 
OfficeParser - Class in org.apache.tika.parser.microsoft
Defines a Microsoft document content extractor.
OfficeParser() - Constructor for class org.apache.tika.parser.microsoft.OfficeParser
 
oneBuf - Variable in class org.apache.tika.parser.pkg.tar.TarInputStream
This contents of this array is not used at all in this class, it is only here to avoid repreated object creation during calls to the no-arg read method.
OpenOfficeContentParser - Class in org.apache.tika.parser.opendocument
Parser for OpenDocument content.xml files.
OpenOfficeContentParser() - Constructor for class org.apache.tika.parser.opendocument.OpenOfficeContentParser
 
OpenOfficeMetaParser - Class in org.apache.tika.parser.opendocument
Parser for OpenDocument meta.xml files.
OpenOfficeMetaParser() - Constructor for class org.apache.tika.parser.opendocument.OpenOfficeMetaParser
 
OpenOfficeParser - Class in org.apache.tika.parser.opendocument
OpenOffice parser
OpenOfficeParser() - Constructor for class org.apache.tika.parser.opendocument.OpenOfficeParser
 
org.apache.tika.cli - package org.apache.tika.cli
 
org.apache.tika.config - package org.apache.tika.config
 
org.apache.tika.exception - package org.apache.tika.exception
 
org.apache.tika.gui - package org.apache.tika.gui
 
org.apache.tika.metadata - package org.apache.tika.metadata
A Multi-valued Metadata container, and set of constant fields for Tika Metadata.
org.apache.tika.mime - package org.apache.tika.mime
 
org.apache.tika.parser - package org.apache.tika.parser
 
org.apache.tika.parser.asm - package org.apache.tika.parser.asm
 
org.apache.tika.parser.audio - package org.apache.tika.parser.audio
 
org.apache.tika.parser.html - package org.apache.tika.parser.html
 
org.apache.tika.parser.image - package org.apache.tika.parser.image
 
org.apache.tika.parser.microsoft - package org.apache.tika.parser.microsoft
 
org.apache.tika.parser.mp3 - package org.apache.tika.parser.mp3
 
org.apache.tika.parser.opendocument - package org.apache.tika.parser.opendocument
 
org.apache.tika.parser.pdf - package org.apache.tika.parser.pdf
 
org.apache.tika.parser.pkg - package org.apache.tika.parser.pkg
 
org.apache.tika.parser.pkg.bzip2 - package org.apache.tika.parser.pkg.bzip2
 
org.apache.tika.parser.pkg.tar - package org.apache.tika.parser.pkg.tar
 
org.apache.tika.parser.rtf - package org.apache.tika.parser.rtf
 
org.apache.tika.parser.txt - package org.apache.tika.parser.txt
 
org.apache.tika.parser.xml - package org.apache.tika.parser.xml
 
org.apache.tika.sax - package org.apache.tika.sax
 
org.apache.tika.sax.xpath - package org.apache.tika.sax.xpath
 
org.apache.tika.utils - package org.apache.tika.utils
 

P

PackageParser - Class in org.apache.tika.parser.pkg
Abstract base class for parsers that deal with package formats.
PackageParser() - Constructor for class org.apache.tika.parser.pkg.PackageParser
 
PAGE_COUNT - Static variable in interface org.apache.tika.metadata.MSOffice
 
parse(String) - Static method in class org.apache.tika.mime.MediaType
Parses the given string to a media type.
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.asm.ClassParser
 
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.audio.AudioParser
 
parse(InputStream, Metadata) - Method in class org.apache.tika.parser.audio.AudioParser
 
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.audio.MidiParser
 
parse(InputStream, Metadata) - Method in class org.apache.tika.parser.audio.MidiParser
 
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.AutoDetectParser
 
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.CompositeParser
Delegates the call to the matching component parser.
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.EmptyParser
 
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.ErrorParser
 
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.ExternalParser
Executes the configured external command and passes the given document stream as a simple XHTML document to the given SAX content handler.
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.html.HtmlParser
 
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.image.ImageParser
 
parse(POIFSFileSystem, XHTMLContentHandler) - Method in class org.apache.tika.parser.microsoft.ExcelExtractor
Extracts text from an Excel Workbook writing the extracted content to the specified Appendable.
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.microsoft.OfficeParser
Extracts properties and text from an MS Document input stream
parse(DocumentEntry, Metadata) - Method in class org.apache.tika.parser.microsoft.OfficeParser
 
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.mp3.Mp3Parser
 
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.opendocument.OpenOfficeParser
 
parse(InputStream, ContentHandler, Metadata) - Method in interface org.apache.tika.parser.Parser
Parses a document stream into a sequence of XHTML SAX events.
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.ParserDecorator
Delegates the method call to the decorated parser.
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.ParserPostProcessor
Forwards the call to the delegated parser and post-processes the results as described above.
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.pdf.PDFParser
 
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.pkg.Bzip2Parser
Parses the given stream as a gzip file.
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.pkg.GzipParser
Parses the given stream as a gzip file.
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.pkg.TarParser
Parses the given stream as a tar file.
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.pkg.ZipParser
Parses the given stream as a Zip file.
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.rtf.RTFParser
 
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.txt.TXTParser
 
parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.xml.XMLParser