18/06/2019 02:07:14 PM

Phần mềm-Ứng dụng

HDFS (Hadoop Distributed File System)

HDFS, as the name implies, is a distributed file system. It stores a file across a cluster of commodity servers.

It was designed to store and provide fast access to big files and large datasets. It is scalable and fault tolerant.

HDFS is a block-structured file system. Just like Linux file systems, HDFS splits a file into fixed-size

blocks, also known as partitions or splits. The default block size is 128 MB, but it is configurable. It should be

clear from the blocks’ size that HDFS is not designed for storing small files. If possible, HDFS spreads out the

blocks of a file across different machines. Therefore, an application can parallelize file-level read and write

operations, making it much faster to read or write a large HDFS file distributed across a bunch of disks on

different computers than reading or writing a large file stored on a single disk.

Distributing a file to multiple machines increases the risk of a file becoming unavailable if one of the

machines in a cluster fails. HDFS mitigates this risk by replicating each file block on multiple machines. The

default replication factor is 3. So even if one or two machines serving a file block fail, that file can still be

read. HDFS was designed with the assumption that machines may fail on a regular basis. So it can handle

failure of one or more machines in a cluster.

» Tin mới nhất:

The TMM Structure: The Testing Maturity Levels (17/07/2025)
Process improvement models (17/07/2025)
The general requirements for TMM development (17/06/2025)
The Need for a Testing Maturity Model (17/06/2025)
Procedures for Process Reuse (19/05/2025)

» Các tin khác:

HADOOP (18/06/2019)
Test Oracle (16/05/2019)
Faults (Defects) (16/05/2019)
Mối liên quan giữa phần MS Access và các môn học khác đối với sinh viên không chuyên tin (18/04/2019)
Phân biệt giữa dữ liệu và thông tin? (18/04/2019)
Goals for the Testers’ Workbench (17/04/2019)
Benefits of a Defect Prevention Program (17/04/2019)
Ứng dụng việc thiết kế REPORT (18/03/2019)
Ứng dụng việc thiết kế FORM (18/03/2019)
GENERATE INPUTS USING EXISTING DB STATES (18/03/2019)

Hôm nay, ngày

21/08/2025

Tuần học:

Sinh viên tiêu biểu

video

Số lượt truy cập: 10269350