Excel
Excel is a widely used spreadsheet software for data analysis and statistics. ClickPipes supports reading Excel files stored on local, FTP, SFTP, SMB, or S3FS to meet a variety of data flow needs.
This article describes how to connect to Excel data source on ClickPipes.
Supported File Formats
Supports both XLS and XLSX formats and forms that include merged cells and formulas.
Precautions
- A connection only refers to a model (table). If there are multiple models, you need to create connections for each one separately.
- Excel data sources are only available for data transformation tasks.
- To avoid excessive memory usage on machines, it is recommended that Excel files do not exceed 512 MB.
- By default, ClickPipes checks for changes (such as adding or modifying) to the XML file every minute. ClickPipes synchronizes all files to accurately update the condition field when changes are detected. However, synchronous deletion of files or data is not supported.
- Data types that can be used in Excel files include BOOLEAN, DATE, DOUBLE, STRING, and TEXT.
Preparations
Before establishing the connection, it is essential to follow the necessary preparations outlined in the article. These preparations may include authorizing an account and performing other relevant steps to ensure a smooth and secure connection. This article describes the steps to prepare for creating an Excel data source connection.
Select the option to read based on the location of the Excel file.
Stored on Local/FTP/SFTP/SMB
Stored on Amazon S3
Stored on OSS
Source Config
:::tip
For more about how to obtain the information needed for each file protocol, see Preparations.
:::
* **Connection name**: Fill in a unique name that has business significance.
* **Connection type**: Currently only supported as a **Source**.
* **File Protocol**: Select the following protocol based on the location of the Excel files. This article uses **S3FS** (Amazon S3 bucket) as an illustration of the operation process.
* **FTP** (File Transfer Protocol): The file is stored on the FTP server. After selecting this item, you also need to fill in the address, port, username, password, file path, and other information of the FTP server.
* **SFTP** (Secure File Transfer Protocol): The file is stored on the SFTP server. After selecting this item, you also need to fill in the address, port, user name, password, file path, and other information of the SFTP server.
* **SMB** (Server Message Block Protocol): The file is stored on the SMB server and is compatible with 1.x, 2.x, 3.x. After selecting this item, you also need to fill in the address, username, password, file path, and other information of the SMB server.
* **S3FS** (file system according to S3 protocol): The file is stored on the Amazon S3 bucket. After selecting this item, you need to fill in the information such as Accesskey, Secretkey, Endpoint (fixed at **s3.amazonaws.com**), Bucket, and file path.
* **OSS** (Object Storage Service): The file is stored on the Alibaba Cloud Object Storage. After selecting this item, you need to fill in the Accesskey, Secretkey, Endpoint, Bucket, and file path.