[FEATURE] escape user provided arguments appropriately for the platform #2
Description
What / Why
Currently we use JSON.stringify()
on each provided user argument as an attempt to escape them, this causes a significant number of issues however. Notably passing literal bash variable names (i.e. npm run foo -- 'echo $1'
) does not work correctly, we also fail to correctly escape sequences like \n
.
In order to resolve this, we plan to reintroduce puka.
These changes are inspired by the work of @lydell, who has kindly offered their continued collaboration on this problem, in this pull request, which was an excellent start but identified some problems with how puka does this escaping currently. Due to the lack of shebang support in Windows we place batch files (foo.cmd
) for bins installed by npm rather than symlinks as we do for Linux/OSX. This shim requires an extra level of escaping to ensure the correct string is received by the node package being run. While puka currently does this double escaping correctly, it causes a problem when the command portion of a script is not a batch file (i.e. node.exe
) where we only want one level of escaping.
The author of puka, @rhendric, has very kindly offered to help us with a solution for this by attempting to determine if the command being run is a batch script or not and adjusting the levels of escaping appropriately for Windows users.
The posix shell escaping appears to be sufficient in puka as it is today and does not appear to need any modification.
This issue will serve as the focal point for communication between the npm team, @rhendric and @lydell while they collaborate with us to get this implemented.
Implementation
As is implemented and discussed in the original run-script pull request, we will apply this escaping only to the user provided additional arguments. We will not attempt to coerce a script declared in a package.json to run on a platform other than the one it was designed for.
We have identified a condition matrix that we believe will give us a starting point:
- shell type:
- posix sh-like (any differences between bash vs sh vs zsh?)
- windows cmd (regular old binary)
- windows cmd (batch file)
- windows powershell? (aside: maybe we could fix some stuff by using powershell instead of bash? or maybe that'd make things worse? idk, but it is ubiquitous now.)
- command being run:
npx some-package some args
npm exec some-package some args
npx --package some-package some command
npm exec --package some-package some-command
npx --package some-package -c 'some-command'
npm exec --package some-package -c 'some-command'
npm run foo
(wherescripts.foo
issome-command some-args
)npm run foo -- some-args
- funkiness of args (all of these are the unquoted actual string value in
<code>
block, so if you see'
, then it has'
chars in it)$1
(ie: get literal $1 as the arg)"$1"
(arg has double-quotes)'$1'
(arg has single-quotes)\$1
--arg="$1"
--arg=npm exec -c "$1"
--arg=npm exec -c '$1'
'--arg=npm exec -c "$1"'
For each item in this matrix, we should identify the expected result, the current result, and any breakage that may be caused by replacing the current result with the expected one. Ideally this same matrix will be used to build a suite of tests that will help us ensure that we are doing the correct thing and we do not introduce regressions.