pypyr.steps.fileformat
find & replace tokens in text files
Parses input text file, substitutes {tokens} in the text of the file from the pypyr context and writes the result to an output file.
- name: pypyr.steps.fileformat
comment: read a file from disk,
do some substitutions,
write back to disk.
in:
fileFormat:
in: ./in/arb.txt # required
out: ./out/arb.txt # optional
So if you had a text file in/arb.txt
like this:
{k1} sit thee down and write
In a book that all may {k2}
And your pypyr context were:
k1: pypyr
k2: read
You would end up with an output file out/arb.txt
like this:
pypyr sit thee down and write
In a book that all may read
The file in
and out
paths support substitutions.
See a worked example of fileformat.
escape characters
If your file contains {curly braces} that pypyr should NOT try to substitute, you can escape these by {{doubling}}. A doubled curly brace will output as a single brace in the output file.
If doubling the braces gets annoying, you could use filereplace instead and pick your own replacement token indicators that work better for your file.
For json, yaml and toml, where {curly braces} have structural meaning, you can use fileformatjson, fileformatyaml and fileformattoml.
file format settings
in
and out
behave in the same way for all the format style steps. Therefore
this section’s description of in
and out
settings applies not only to
fileformat
, but also
fileformatjson,
fileformatyaml,
fileformattoml and
filereplace.
set input & output files
fileformat
expects the following context keys:
fileFormat
in
- Mandatory path(s) to source file on disk.
- This can be a string path to a single file, or a glob, or a list of paths and/or globs.
- Each path can be a glob, a relative or absolute path.
out
- Write output file to here. Will create directories in path if these do not exist already.
out
is optional. If not specified, will edit thein
files in-place.- If in-path refers to >1 file (e.g it’s a glob or list), out path can only be a directory - it doesn’t make sense to write >1 file to the same single file output (this is not an appender.)
- To ensure out path means a directory and not a file,
be sure to have the os’ path separator at the end (
/
on a sensible filesystem) - If you specify an
out
directory without a file-name, out files will have the same name they had inin
.
multiple files & globs
You can format multiple files in the same step. This works in two ways:
in
is glob like./**/myfile.arb
fileFormat:
# ** recurses sub-dirs per usual globbing
in: ./testfiles/sub3/**/*.txt
# note the dir separator at the end.
# since >1 in files, out can only be a dir.
out: ./out/replace/
in
is a list of paths and/or globs.
fileFormat:
in:
# ** recurses sub-dirs per usual globbing
- ./testfiles/sub3/**/*.txt
- ./testfiles/??b/fileformat-in.*.txt
- ./testfiles/static.file
# note the dir separator at the end.
# since >1 in files, out can only be a dir.
out: ./out/replace/
in-place edit
If you do not specify out
, pypyr will over-write (i.e edit in-place) all the
files you specify in in
.
fileFormat:
# in-place edit/overwrite all the files in. this can also be a glob, or
# a mixed list of paths and/or globs.
in: ./infile.txt
substitutions on paths
The file in and out paths support substitutions, which allows you to specify paths dynamically.
- name: pypyr.steps.set
comment: set some arb values in context
in:
set:
myfilename: input-file
myoutputfile: out/output.txt
- name: pypyr.steps.fileformat
comment: you can set in & out entirely or partially with formatting expressions
in:
fileFormat:
in: testfiles/{myfilename}.ext
out: '{myoutputfile}'
encoding
By default in
will read and out
will write in the platform’s default
encoding. This is utf-8
for most systems, but be aware on Windows it’s still
cp1252
.
You can use the encoding
input explicitly to set the encoding:
- name: pypyr.steps.fileformat
comment: set encoding
in:
fileFormat:
in: testfiles/infile.txt
out: testfiles/outfile.txt
encoding: utf-8
You can also individually set the encoding for in
and out
. This allows you
to convert a file from one encoding to another:
- name: pypyr.steps.fileformat
comment: set encoding
in:
fileFormat:
in: testfiles/infile.txt
out: testfiles/outfile.txt
encodingIn: ascii
encodingOut: utf-16
All of these are optional - if you do not explicitly over-ride the encoding for
either in
or out
, pypyr will just use the system default.
See here for more details on handling text encoding in pypyr and changing the defaults.
See here for a list of available encoding codecs.