By selecting areas on a page or by specifying particular text, and applying a particular command to that area, co-ordinate files can be created in Adobe Acrobat to make ARTS PDF Split Pro™ split based on the text on the pages of a PDF. The full list of commands available can be found below.
Split if text is present |
Split if provided string is present on the page.
|
Split if text contained in box |
Split the file if provided string is found in the rectangle. |
Split if text is a run |
Splits if the text is present in any run. Text that spans multiple runs
of text is ignored. |
Split on text change trailing string |
Splits if the text inside the rectangle changes from one page to the next.
|
Split on text change after string |
Looks at the text inside the
rectangle, and if the text after the provided string changes from page to
page, then the file is split.
|
Split if this text repeats |
Looks at text inside rectangle, and
if string appears in rectangle on two pages in a row, splits between these
pages.
|
Skip header |
On commands such as
splitiftextispresent, this will throw out all pages that are positioned
before the first split.
|
Provide filename |
Takes the text that is inside the rectangle on the first page of a fragment to the filename for that
fragment.
|
Provide filename from first word |
Adds the first word of the text that
is inside the rectangle on the 1st page of a fragment to the filename for
that fragment |
Provide filename from selected word |
Adds desired word of the text that
is inside the rectangle on 1st page of a fragment to filename for that
fragment.
|
Provide filename from range of characters |
Adds desired range of characters of
text inside the rectangle on 1st page of fragment to the filename for that
fragment. |
Provide filename after string |
Takes text inside rectangle on 1st
page of fragment and adds the text that comes after this string to the
filename for that fragment |
|
Provide base filename
|
Takes the literal string as the base filename for each fragment.
|
|
Provide filename if string is in box
|
Looks at text inside rectangle and if string appears, takes the text that is inside the rectangle on
the first page of a fragment to the filename for that fragment.
|
|
Provide filename after string in box
|
Takes text inside rectangle and after string on 1st page of fragment to the filename for that fragment.
|
Add to filename |
Adds the literal string to the filename for each fragment.
|
Add to filename text in box
|
Adds text inside the box to the filename of each fragment.
|
|
Add to filename if string in box
|
This looks at the text inside the rectangle and if string appears, adds text inside the box to the filename
of each fragment.
|
Add to filename after string in box
|
This looks at the text inside the rectangle and if string appears, adds text after the string inside
the box to the filename of each fragment.
|
Delete characters from filename |
Any character that is in the literal
string will be deleted from the filenames for fragments. Numeric digits in
string may error.
|
Filter filename |
The literal string is a sequence of
character pairs. If the first character is found in the fragment filename, it
will be replaced with 2nd char. |
Get filenames from list in file |
The string is a complete path to a text file that contains a list of
filenames to use for the fragments. |
Include if text is present |
If the literal string is found to
occur in the text that is inside the
rectangle, the page is included in the fragment. |
Include all |
Type false for the literal string to
cause all pages to be not included in any fragment unless another command
(e.g. splitiftextispresent) causes it. |
|
Include if text contained in box
|
This looks at the text inside the rectangle and if string appears, the page is included in the fragment.
|
Include if text change after string
|
This looks at the text inside the rectangle, and if the page after the string changes from page to
page, then the page is included in the fragment.
|
Exclude if text is present |
If the literal string is found to occur in the text that is inside the rectangle, the page is excluded
in the fragment.
|
Exclude if text contained in box
|
This looks at the text inside the rectangle and if string appears, the page is excluded in the fragment.
|
|
Exclude if text not contained in box
|
This looks at the text inside the rectangle and if any text appears, the page is excluded in the
fragment.
|
Exclude if text is present |
If the literal string is found to occur in the text that is inside the rectangle, the page is excluded
in the fragment.
|
Extract filename |
The literal string should be the
complete path of the file that will be created to contain all of the
extracted text.
|
Extract text |
The text inside the rectangle will
be extracted into the text file that was specified by the extract filename
command. |
Extract text selected words |
The chosen range of words from the
text inside the rectangle will be extracted into the text file that was
specified by extract filename command.
|
Extract text skip words |
The text inside the rectangle after
the specified number of words are skipped will be extracted into the text
file that was specified by extract filename command.
|
Extract text skip this word |
The text that comes inside the
rectangle that comes after the specified literal text will be extracted into
the text file that was specified by the extractfilename.
|
Extract text skip characters |
The text inside the rectangle that
comes after the specified number of characters are skipped will be extracted
into the text file that was specified.
|
Extract only if this text in rect |
Turns on the text extraction if the
literal string occurs in the text inside the rectangle, and turns text
extraction off otherwise.
|
Extract only if this text not in rect |
Turns on the text extraction if the
literal string does not occur in the text inside the rectangle, and turns
text extraction off otherwise.
|
Extract textline after string |
If the literal text is found on a
visual line of text in the file, the rest of the text on that line only is
extracted. |
fillInfoDictEntry |
Takes the text that is inside the rectangle and stores it within the specified
key of the Info dictionary (Document Summary). |