Data vs Clients Nodes

Hadoop needs to use its FileSystem remotely from client nodes as well as directly on
data nodes. Client nodes are responsible for basic file system operations as well as
accessing data nodes remotely. Usually, client nodes are started together
with job-submitter or job-scheduler processes, while data nodes are usually
started together with Hadoop task-tracker processes.