You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+52-3Lines changed: 52 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -102,7 +102,7 @@ What this does:
102
102
- keeps raw InterProScan TSV outputs inside `dnmb_interproscan/`
103
103
- on Linux, runs as your current host UID/GID in the direct `docker run` examples so output and cache files stay writable by you
104
104
- on arm64 hosts, add `--platform linux/amd64` to the direct `docker run` command because the published image is currently `amd64`
105
-
- lets you control module selection through `DNMB_MODULES`, `DNMB_SKIP_MODULES`, `DNMB_MODULE_CPU`, `DNMB_PROPHAGE_BACKEND`, and `DNMB_CLEAN_PREVIOUS`
105
+
- lets you control module selection through `DNMB_MODULES`, `DNMB_SKIP_MODULES`, `DNMB_MODULE_CPU`, `DNMB_PROPHAGE_BACKEND`, `DNMB_CLEAN_PREVIOUS`, `DNMB_COMPARATIVE`, and `DNMB_COMPARATIVE_DATA_ROOT`
106
106
107
107
## Optional Shell Launcher
108
108
@@ -343,6 +343,56 @@ the per-genome pipeline finishes (it calls each plotter below against
343
343
`dirname(getwd())`, or `comparative_data_root` when supplied). The
344
344
individual plotters stay useful when you want a subset or custom colors.
345
345
346
+
Run the comparative suite directly with Docker, without opening R, after
347
+
per-genome DNMB runs have already produced one genome folder per GenBank file:
348
+
349
+
```text
350
+
data/
351
+
├── genome_1/
352
+
│ └── input_1.gbff
353
+
├── genome_2/
354
+
│ └── input_2.gbff
355
+
└── genome_3/
356
+
└── input_3.gbff
357
+
```
358
+
359
+
From the parent directory that contains `data/`:
360
+
361
+
```bash
362
+
docker run --rm \
363
+
--platform linux/amd64 \
364
+
--user "$(id -u):$(id -g)" \
365
+
-e DNMB_MODULE_CPU=8 \
366
+
-v "$PWD/data:/data" \
367
+
-v "$HOME/.dnmb-cache:/opt/dnmb/cache" \
368
+
ghcr.io/jaeyoonsung/dnmbsuite:latest \
369
+
comparative /data
370
+
```
371
+
372
+
This writes comparative PDFs and count matrices under:
373
+
374
+
```text
375
+
data/comparative/
376
+
```
377
+
378
+
To run a per-genome DNMB analysis and render the comparative suite at the end
379
+
from a focal genome folder, use:
380
+
381
+
```bash
382
+
cd /path/to/data/genome_1
383
+
384
+
docker run --rm \
385
+
--platform linux/amd64 \
386
+
--user "$(id -u):$(id -g)" \
387
+
-e DNMB_COMPARATIVE=1 \
388
+
-e DNMB_COMPARATIVE_DATA_ROOT=/data \
389
+
-e DNMB_MODULE_CPU=8 \
390
+
-v "$(dirname "$PWD"):/data" \
391
+
-v "$HOME/.dnmb-cache:/opt/dnmb/cache" \
392
+
ghcr.io/jaeyoonsung/dnmbsuite:latest \
393
+
/data/$(basename "$PWD")
394
+
```
395
+
346
396
After per-genome DNMB runs finish, render across-genome heatmaps for
347
397
defense-module families as well as enzyme/CAZyme modules. Each plotter
348
398
treats every GenBank-bearing subfolder of `data_root` as one genome,
@@ -392,7 +442,7 @@ Pass `auto_run_missing = FALSE` to render only what already exists.
392
442
to the direct `docker run` command after creating that host folder once with:
0 commit comments