And with wildcards it would be:
/user/inputLocation/*
And in case of nested partitions, we can have a combination of globs and wildcards, such as:
/user/inputLocation/{Partition-1,Partition-2, Partition-3,....Partition-n}/*
Pig能够顺利从对应位置读取数据,并在操作执行当中对其加以优化。如此一来,Pig面临的惟一障碍就是集群资源的不可用性。除此之外,在对数据进行大量变换的情况下,Apache Pig的表现可以说无与伦比。
Hive与Pig间的对决
以下信息能够帮助大家了解Hive与Pig各自的设计思路与工作原理。