From 78c41cbb550e7cac647eb5b6f3f345186d43dccb Mon Sep 17 00:00:00 2001 From: Manish R Jain <manishrjain@gmail.com> Date: Thu, 14 Apr 2016 12:23:00 +1000 Subject: [PATCH] Note about merging --- README.md | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index af28e1df..3470af0c 100644 --- a/README.md +++ b/README.md @@ -163,8 +163,10 @@ $ go build . && ./uidassigner --numInstances 2 --instanceIdx 1 --rdfgzips $BENCH Once the shards are generated, you need to merge them before the second pass. If you ran this as a single instance, merging isn't required. ``` $ cd $GOPATH/src/github.com/dgraph-io/dgraph/tools/merge -$ go build . && ./merge --stores ~/dgraph/uids --dest ~/dgraph/final.uids +$ go build . && ./merge --stores ~/dgraph/uids --dest ~/dgraph/uasync.final ``` +The above command would iterate over all the directories in `~/dgraph/uids`, and merge their data into one `~/dgraph/uasync.final`. +Note that this merge step is important if you're generating multiple uid intances, because all the loader instances need to have access to global uids list. #### Second Pass: Data Loader Now that we have assigned UIDs for all the entities, the data is ready to be loaded. -- GitLab