Master corpora are the core of the engine. Globalese will use master corpora as a reference when training the engine. The training process will use segment pairs from the auxiliary and/or stock corpora that are from the same domain as the master corpus with a higher weight, and others with a lower weight.
Auxiliary corpora, just like stock corpora, will be used to enrich the master corpora. A bigger pool of auxiliary corpora means a bigger selection base for the training process.
Only the content most closely related to the master corpora will eventually be used for training the engine, so feel free to add any material that has good linguistic value.