alien.py - Python interface to websocket endpoint of ALICE Grid Services
Quick containerized testing:
singularity run library://adriansev/default/alienpy:latest [cmd]
or
singularity run oras://registry.cern.ch/asevcenc/alienpy:latest [cmd]
latest
usually would point to master but not always.(if desired and needed, request by email a new latest tag)
see here what tags are available and their dates of creation.
The docker images can be found @DockerHub
docker run -it \
--user $(id -u):$(id -g) \
--workdir="/home/$USER" \
--env TMPDIR="${TMPDIR:-/tmp}" \
--volume="/etc/group:/etc/group:ro" \
--volume="/etc/passwd:/etc/passwd:ro" \
--volume="/etc/shadow:/etc/shadow:ro" \
--volume="/home:/home" \
adriansevcenco/alienpy.dock:latest [cmd]
if no cmd is passed, the shell form will start
Basic usage
Can be used as command mode and interactive mode :
-
Command mode :
alien.py <command>
e.g :
alien.py pwd
N.B. command/arguments must be quoted to avoid being interpreted by the shell:
alien.py 'rm my_alien_dir/*'
-
Interactive/shell mode e.g :
alien.py
Welcome to the ALICE GRID
support mail: adrian.sevcenco@cern.ch
AliEn[asevcenc]:/alice/cern.ch/user/a/asevcenc/ >pwd
/alice/cern.ch/user/a/asevcenc/
AliEn[asevcenc]:/alice/cern.ch/user/a/asevcenc/ >whoami
asevcenc
AliEn[asevcenc]:/alice/cern.ch/user/a/asevcenc/ >
- For both command and shell mode multiple commands can be issued separated by
;
- The interactive mode save the command history in
${HOME}/.alienpy_history
and it can be navigated with Up/Down keys !
is understood as running into shell whatever command follows|
pipe whatever output of AliEn command to a shell command (that follows after the first(only the first) |
)
Environment steering
There are a few environment variables that influence the mechanics of the script :
- JALIEN_TOKEN_CERT, JALIEN_TOKEN_KEY - will overwrite the defaults; they are either full path certificate,key token files OR their respective contents
- If set the following will be honored: X509_USER_CERT, X509_USER_KEY, X509_CERT_DIR or X509_CERT_FILE
- ALIENPY_TIMEOUT will change the interval for keep-alive mechanics.
- ALIENPY_CONNECT_TRIES - default = 3 : number of connect trials
- ALIENPY_CONNECT_TRIES_INTERVAL - default = 0.5 : seconds between connection trials
For debugging purposes there are a few environment toggles :
- ALIENPY_JSON - print the unprocessed json message from the server
- ALIENPY_JSONRAW - print the unprocessed byte stream message from the server
- ALIENPY_JCENTRAL - it will connect to this server, ignoring any other options
- ALIENPY_NO_STAGGER - disable staggered parallel host resolution and socket creation (see RFC8305)
- ALIENPY_DEBUG - detailed debug meesages will be found in ALIENPY_DEBUG_FILE
- ALIENPY_DEBUG_FILE - set the location of log file
- ALIENPY_DEBUG_APPEND - is set the output will be appended to the present log file. if not the file will be overwritten.
- ALIENPY_TIMECONNECT - if set will report time for websocket creation - e.g.
ALIENPY_TIMECONNECT=1 alien.py pwd
- ALIENPY_TIMING - report detailed operation timing in the log file.
DEBUG file copy operations:
- ALIENPY_KEEP_META - keep the metafile generated for download operations. Can be directly used with xrdcp.
- XRD_LOGLEVEL='Dump'
- XRD_LOGFILE=xrdlog.txt
See also the native XRootD environment toggles: docs
Authentication
The authentication process needs the presence of a X509 certificate (enrolled into ALICE VO, see here)
and of a CA certificates directory for verification.
The default CA location that will be searched is within alice.cern.ch cvmfs repository
If not found, the CApath will default to /etc/grid-security/certificates
If these locations are not available, one must set X509_CERT_DIR to a valid location
Command usage and examples
The list of available commands can seen with: ?
or help
Command help can be listed with: ? command
, help command
, command -h
Storage related operations
This section refer to any copy to/from grid or file interactions.
cat/more/less
will download the target lfn to a temporary file and will act upon it while
vi/nano/mcedit/edit
will, after the modification of downloaded temporary, backup the existing lfn, and upload the modified file
The target file upload can support grid specifiers like those described in cp
command e.g. edit my_file@disk:2,SE1
cp
option
cp can take as arguments both files and directories and have the following options:
alien.py cp -h
Command format is of the form of (with the strict order of arguments):
cp <options> src dst
or
cp <options> -input input_file
location prefixes are: file: | file:// | alien: | alien://
if one prefix is specified the other operator is considered of the other kind (no local -> local, or grid->grid operations allowed)
if no prefix is specified, the src will be _first_ checked if local and then if remote.
-input argument is a file with >src dst< pairs
after each src,dst can be added comma separated specifiers in the form of: @disk:N,SE1,SE2,!SE3
where disk selects the number of replicas and the following specifiers add (or remove) storage endpoints from the received list
options are the following :
-h : print help
-f : replace destination file (if destination is local it will be replaced only if integrity check fails)
-P : enable persist on successful close semantic
-cksum : check hash sum of the file; for downloads the central catalogue md5 will be verified;
for uploads (for xrootd client > 4.12.0) a hash type will be negociated with remote and transfer will be validated
-y <nr_sources> : use up to the number of sources specified in parallel (N.B. Ignored as it breaks download of files stored in archives)
-S <aditional TPC streams> : uses num additional parallel streams to do the transfer. (max = 15)
-chunks <nr chunks> : number of chunks that should be requested in parallel
-chunksz <bytes> : chunk size (bytes)
-T <nr_copy_jobs> : number of parralel copy jobs from a set (for recursive copy); defaults to 8 for downloads
-noxrdzip: circumvent the XRootD mechanism of zip member copy and download the archive and locally extract the intended member.
N.B.!!! for recursive copy (all files) the same archive will be downloaded for each member.
If there are problems with native XRootD zip mechanism, download only the zip archive and locally extract the contents
for the recursive copy of directories the following options (of the find command) can be used:
-glob <globbing pattern> : this is the usual AliEn globbing format; N.B. this is NOT a REGEX!!! defaults to all "*"
-select <pattern> : select only these files to be copied; N.B. this is a REGEX applied to full path!!!
-name <pattern> : select only these files to be copied; N.B. this is a REGEX applied to a directory or file name!!!
-name <verb>_string : where verb = begin|contain|ends|ext and string is the text selection criteria.
verbs are aditive : -name begin_myf_contain_run1_ends_bla_ext_root
N.B. the text to be filtered cannont have underline <_> within!!!
-parent <parent depth> : in destination use this <parent depth> to add to destination ; defaults to 0
-a : copy also the hidden files .* (for recursive copy)
-j <queue_id> : select only the files created by the job with <queue_id> (for recursive copy)
-l <count> : copy only <count> nr of files (for recursive copy)
-o <offset> : skip first <offset> files found in the src directory (for recursive copy)
.
, ..
are interpreted for all grid names (lfns)
%ALIEN
is converted to user AliEn home directory
lfns that don't start with a /
will have the current directory appended before being processed
Miscellaneous
The shell prompt
It can show date and/or local directory:
prompt date
will toggle on/off the dateprompt pwd
will toggle on/off the local current directory
For permanent setting the following are env variables are available : ALIENPY_PROMPT_DATE, ALIENPY_PROMPT_CWD
AliEn[asevcenc]:/alice/cern.ch/user/a/asevcenc/ >prompt date
2020-02-07T16:49:05 AliEn[asevcenc]:/alice/cern.ch/user/a/asevcenc/ >
AliEn[asevcenc]:/alice/cern.ch/user/a/asevcenc/ >prompt pwd
AliEn[asevcenc]:/alice/cern.ch/user/a/asevcenc/ local:/home.hdd/adrian/work-GRID/jalien_py >
CWD persistence
Default behaviour is to save (and then restore) the last used CWD.
This bevahiour can be disabled with the env var ALIENPY_NO_CWD_RESTORE
ls
aliases
ll
, la
, lla
are aliases to ls -l
, ls -a
, ls -la
Custom aliases
A fixed file ${HOME}/.alienpy_aliases
can be used to define alias=string pairs that will be used(translated) in the usage of alien.py. One can do myalias=cmd1;cmd2;cmd3
and the myalias
string will be replaced by it's value when used.
Python shell
term
command will open an Python shell within the context of alien.py and with a session object loaded
API usage
see examples directory