catalog_service_port (int32) |
port where the CatalogService is running |
26000 |
26000 |
false |
catalog_topic_mode (string) |
The type of data that the catalog service will publish into the Catalog StateStore topic. Valid values are 'full', 'mixed', or 'minimal'.
In 'full' mode, complete catalog objects are published any time a new version is available. In 'minimal' mode, only a minimal object is published when a new version of a catalog object is available. In 'mixed' mode, both types of topic entries are published.
When all impalad coordinators are configured with --use_local_catalog disabled (the default), 'full' mode should be used. If all impalad coordinators are configured with --use_local_catalog enabled, 'minimal' mode should be used. When some impalads are configured with --use_local_catalog disabled and others configured with it enabled, then 'mixed' mode is required. |
full |
full |
false |
initial_hms_cnxn_timeout_s (int32) |
Number of seconds catalogd will wait to establish an initial connection to the HMS before exiting. |
120 |
120 |
false |
load_catalog_in_background (bool) |
If true, loads catalog metadata in the background. If false, metadata is loaded lazily (on access). |
false |
false |
false |
max_hdfs_partitions_parallel_load (int32) |
(Advanced) Number of threads used to load block metadata for HDFS based partitioned tables. Due to HDFS architectural limitations, it is unlikely to get a linear speed up beyond 5 threads. |
5 |
5 |
false |
max_nonhdfs_partitions_parallel_load (int32) |
(Advanced) Number of threads used to load block metadata for tables that do not support the notion of blocks/storage IDs. Currently supported for S3/ADLS. |
20 |
20 |
false |
num_metadata_loading_threads (int32) |
(Advanced) The number of metadata loading threads (degree of parallelism) to use when loading catalog metadata. |
16 |
16 |
false |
sentry_catalog_polling_frequency_s (int64) |
Frequency (in seconds) at which the the catalogd polls the sentry service to update any policy changes. |
60 |
60 |
false |
sentry_config (string) |
Local path to a sentry-site.xml configuration file. If set, authorization will be enabled. |
|
|
false |
asm_module_dir (string) |
if set, saves disassembly for generated IR modules to the specified directory. |
|
|
false |
disable_optimization_passes (bool) |
if true, disables llvm optimization passes (used for testing) |
false |
false |
false |
dump_ir (bool) |
if true, output IR after optimization passes |
false |
false |
false |
opt_module_dir (string) |
if set, saves optimized generated IR modules to the specified directory. |
|
|
false |
perf_map (bool) |
if true, generate /tmp/perf-<pid>.map file for linux perf symbols. This is not recommended for production use because it may affect performance. |
false |
false |
false |
print_llvm_ir_instruction_count (bool) |
if true, prints the instruction counts of all JIT'd functions |
false |
false |
false |
unopt_module_dir (string) |
if set, saves unoptimized generated IR modules to the specified directory. |
|
|
false |
abort_on_config_error (bool) |
Abort Impala startup if there are improper configs or running on unsupported hardware. |
true |
true |
false |
be_port (int32) |
port on which thrift based ImpalaInternalService is exported |
22000 |
22000 |
false |
be_principal (string) |
Kerberos principal for backend network connections only,overriding --principal if set. Must not be set if --principal is not set. |
|
|
false |
buffer_pool_clean_pages_limit (string) |
(Advanced) Limit on bytes of clean pages that will be accumulated in the buffer pool. Specified as number of bytes ('<int>[bB]?'), megabytes ('<float>[mM]'), gigabytes ('<float>[gG]'), or percentage of the buffer pool limit ('<int>%'). Defaults to bytes if no unit is given.. |
10% |
10% |
false |
buffer_pool_limit (string) |
(Advanced) Limit on buffer pool size. Specified as number of bytes ('<int>[bB]?'), megabytes ('<float>[mM]'), gigabytes ('<float>[gG]'), or percentage of the process memory limit (minus the JVM heap if --mem_limit_includes_jvm is true) ('<int>%'). Defaults to bytes if no unit is given. The default value and behaviour of this flag may change between releases. |
85% |
85% |
false |
compact_catalog_topic (bool) |
If true, catalog updates sent via the statestore are compacted before transmission. This saves network bandwidth at the cost of a small quantity of CPU time. Enable this option in cluster with large catalogs. It must be enabled on both the catalog service, and all Impala demons. |
true |
true |
false |
disable_admission_control (string) |
Removed |
__UNSET__ |
false |
true |
disable_kudu (bool) |
If true, Kudu features will be disabled. |
false |
false |
false |
enable_minidumps (bool) |
Whether to enable minidump generation upon process crash or SIGUSR1. |
true |
true |
false |
enable_process_lifetime_heap_profiling (bool) |
(Advanced) Enables heap profiling for the lifetime of the process. Profile output will be stored in the directory specified by -heap_profile_dir. Enabling this option will disable the on-demand/remote server profile handlers. |
false |
false |
false |
enable_rm (string) |
Removed |
__UNSET__ |
false |
true |
enable_stats_extrapolation (bool) |
If true, uses table statistics computed with COMPUTE STATS to extrapolate the row counts of partitions. |
false |
false |
false |
heap_profile_dir (string) |
Output directory to store heap profiles. If not set profiles are stored in the current working directory. |
|
|
false |
hms_event_polling_interval_s (int32) |
Configure catalogd to invalidate cached table metadata based on metastore events. These metastore events could be generated by external systems like Apache Hive or a different Impala cluster using the same Hive metastore server as this one. A non-zero value of this flag sets the polling interval of catalogd in seconds to fetch new metastore events. A value of zero disables this feature. When enabled, this flag has the same effect as "INVALIDATE METADATA" statement on the table for certain metastore event types. Additionally, in case of events which detect creation or removal of objects from metastore, catalogd adds or removes such objects from its cached metadata. This feature is independent of time and memory based automatic invalidation of tables. Note that this is still an experimental feature and not recommended to be deployed on production systems until it is made generally available. |
0 |
0 |
false |
hostname (string) |
Hostname to use for this daemon, also used as part of the Kerberos principal, if enabled. If not set, the system default will be used |
|
hadoop001 |
false |
inc_stats_size_limit_bytes (int64) |
Maximum size of incremental stats the catalog is allowed to serialize per table. This limit is set as a safety check, to prevent the JVM from hitting a maximum array limit of 1GB (or OOM) while building the thrift objects to send to impalads. By default, it's set to 200MB |
209715200 |
209715200 |
false |
invalidate_tables_on_memory_pressure (bool) |
Configure catalogd to invalidate recently unused tables when the old GC generation is almost full. This is independent from time-based invalidation configured by invalidate_table_timeout_s. To enable this feature, a true flag must be applied to both catalogd and impalad. |
false |
false |
false |
invalidate_tables_timeout_s (int32) |
If a table has not been referenced in a SQL statement for more than the configured amount of time, the catalog server will automatically evict its cached metadata about this table. This has the same effect as a user-initiated "INVALIDATE METADATA" statement on the table. Configuring this to 0 disables time-based automatic invalidation of tables. This is independent from memory-based invalidation configured by invalidate_tables_on_memory_pressure. To enable this feature, a non-zero flag must be applied to both catalogd and impalad. |
0 |
0 |
false |
keytab_file (string) |
Absolute path to Kerberos keytab file |
|
|
false |
krb5_ccname (string) |
Absolute path to the file based credentials cache that we pass to the KRB5CCNAME environment variable. |
/tmp/krb5cc_impala_internal |
/tmp/krb5cc_impala_internal |
false |
krb5_conf (string) |
Absolute path to Kerberos krb5.conf if in a non-standard location. Does not normally need to be set. |
|
|
false |
krb5_debug_file (string) |
Turn on Kerberos debugging and output to this file |
|
|
false |
krpc_port (int32) |
port on which KRPC based ImpalaInternalService is exported |
27000 |
27000 |
false |
kudu_client_rpc_timeout_ms (int32) |
(Advanced) Timeout (milliseconds) set for individual Kudu client rpcs. An operation may consist of several rpcs, so this is expected to be less than kudu_operation_timeout_ms. This must be a positive value or it will be ignored and Kudu's default of 10s will be used. There is no way to disable timeouts. |
0 |
0 |
false |
kudu_operation_timeout_ms (int32) |
Timeout (milliseconds) set for all Kudu operations. This must be a positive value, and there is no way to disable timeouts. |
180000 |
180000 |
false |
llama_callback_port (string) |
Removed |
__UNSET__ |
28000 |
true |
load_auth_to_local_rules (bool) |
If true, load auth_to_local configuration from hdfs' core-site.xml. When enabled, impalad reads the rules from the property hadoop.security.auth_to_local and applies them to translate the Kerberos principal to its corresponding local user name for authorization. |
false |
false |
false |
log_filename (string) |
Prefix of log filename - full path is <log_dir>/<log_filename>.[INFO|WARN|ERROR|FATAL] |
|
impalad |
false |
max_log_files (int32) |
Maximum number of log files to retain per severity level. The most recent log files are retained. If set to 0, all log files are retained. |
10 |
10 |
false |
max_minidumps (int32) |
Maximum number of minidump files to keep per daemon. Older files are removed first. Set to 0 to keep all minidump files. |
9 |
9 |
false |
mem_limit (string) |
Limit on process memory consumption. Includes the JVM's memory consumption only if --mem_limit_includes_jvm is true. Specified as number of bytes ('<int>[bB]?'), megabytes ('<float>[mM]'), gigabytes ('<float>[gG]'), or percentage of the physical memory ('<int>%'). Defaults to bytes if no unit is given. |
80% |
80% |
false |
mem_limit_includes_jvm (bool) |
If true, --mem_limit will include the JVM's max heap size and committed memory in the process memory limit. |
false |
false |
false |
min_buffer_size (int64) |
(Advanced) The minimum buffer size to use in the buffer pool |
8192 |
8192 |
false |
minidump_path (string) |
Directory to write minidump files to. This can be either an absolute path or a path relative to log_dir. Each daemon will create an additional sub-directory to prevent naming conflicts and to make it easier to identify a crashing daemon. Minidump files contain crash-related information in a compressed format and will be written when a daemon exits unexpectedly, for example on an unhandled exception or signal. It is also possible to create minidumps on demand without exiting the process by sending SIGUSR1. Set to empty to disable writing minidump files. |
minidumps |
/var/log/impala-minidumps/impalad |
false |
minidump_size_limit_hint_kb (int32) |
Size limit hint for minidump files in KB. If a minidump exceeds this value, then breakpad will reduce the stack memory it collects for each thread from 8KB to 2KB. However it will always include the full stack memory for the first 20 threads, including the thread that crashed. |
20480 |
20480 |
false |
principal (string) |
Kerberos principal. If set, both client and backend network connections will use Kerberos encryption and authentication. Kerberos will not be used for internal or external connections if this is not set. |
|
|
false |
pull_incremental_statistics (bool) |
When set, impalad coordinators pull incremental statistics from catalogd on-demand and catalogd does not broadcast incremental statistics via statestored to coordinators. If used, the flag must be set on both catalogd and all impalad coordinators. This feature should not be used when --use_local_catalog is true. |
true |
true |
false |
read_size (int32) |
(Advanced) The preferred I/O request size in bytes to issue to HDFS or the local filesystem. Increasing the read size will increase memory requirements. Decreasing the read size may decrease I/O throughput. |
8388608 |
8388608 |
false |
redaction_rules_file (string) |
Absolute path to sensitive data redaction rules. The rules will be applied to all log messages and query text shown in the Web UI and audit records. Query results will not be affected. Refer to the documentation for the rule file format. |
|
/var/run/cloudera-scm-agent/process/2676-impala-IMPALAD/redaction-rules.json |
false |
redirect_stdout_stderr (bool) |
If true, redirects stdout/stderr to INFO/ERROR log. |
true |
true |
false |
reserved_words_version (string) |
Reserved words compatibility version. Reserved words cannot be used as identifiers in SQL. This flag determines the impala version from which the reserved word list is taken. The value must be one of ["2.11.0", "3.0.0"]. |
3.0.0 |
3.0.0 |
false |
tcmalloc_max_total_thread_cache_bytes (int64) |
(Advanced) Bound on the total amount of bytes allocated to TCMalloc thread caches. If left at 0 (default), use the default value in TCMalloc library. |
0 |
1073741824 |
false |
local_library_dir (string) |
Scratch space for local fs operations. Currently used for copying UDF binaries locally from HDFS and also for initializing the timezone db |
/tmp |
/var/lib/impala/udfs |
false |
max_audit_event_log_files (int32) |
Maximum number of audit event log files to retain. The most recent audit event log files are retained. If set to 0, all audit event log files are retained. |
0 |
0 |
false |
memory_maintenance_sleep_time_ms (int32) |
Sleep time in milliseconds between memory maintenance iterations |
10000 |
10000 |
false |
pause_monitor_sleep_time_ms (int64) |
Sleep time in milliseconds for pause monitor thread. |
500 |
500 |
false |
pause_monitor_warn_threshold_ms (int64) |
If the pause monitor sleeps more than this time period, a warning is logged. If set to 0 or less, pause monitor is disabled. |
10000 |
10000 |
false |
data_stream_sender_buffer_size (int64) |
(Advanced) Max size in bytes which a row batch in a data stream sender's channel can accumulate before the row batch is sent over the wire. |
16384 |
16384 |
false |
data_source_batch_size (int32) |
Batch size for calls to GetNext() on external data sources. |
1024 |
1024 |
false |
exchg_node_buffer_size_bytes (int64) |
(Advanced) Maximum size of per-query receive-side buffer |
10485760 |
10485760 |
false |
enable_quadratic_probing (bool) |
Enable quadratic probing hash table |
true |
true |
false |
enable_orc_scanner (bool) |
If false, reading from ORC format tables is not supported |
true |
true |
false |
enabled_hdfs_text_scanner_plugins (string) |
(Advanced) whitelist of HDFS text scanner plugins that Impala will try to dynamically load. Must be a comma-separated list of upper-case compression codec names. Each plugin implements support for decompression and hands off the decompressed bytes to Impala's builtin text parser for further processing (e.g. parsing delimited text). |
LZO |
LZO |
false |
max_row_batches (int32) |
the maximum number of batches to queue in multithreaded HDFS scans |
0 |
0 |
false |
min_filter_reject_ratio (double) |
(Advanced) If the percentage of rows rejected by a runtime filter drops below this value, the filter is disabled. |
0.10000000000000001 |
0.10000000000000001 |
false |
kudu_max_row_batches (int32) |
The maximum size of the row batch queue, for Kudu scanners. |
0 |
0 |
false |
kudu_read_mode (string) |
(Advanced) Sets the Kudu scan ReadMode. Supported Kudu read modes are READ_LATEST and READ_AT_SNAPSHOT. Can be overridden with the query option of the same name. |
READ_LATEST |
READ_LATEST |
false |
kudu_scanner_keep_alive_period_sec (int32) |
The period at which Kudu Scanners should send keep-alive requests to the tablet server to ensure that scanners do not time out. |
15 |
15 |
false |
pick_only_leaders_for_tests (bool) |
Whether to pick only leader replicas, for tests purposes only. |
false |
false |
false |
kudu_error_buffer_size (int32) |
The size (bytes) of the Kudu client buffer for returning errors, with a min of 1KB.If the actual errors exceed this size the query will fail. |
10485760 |
10485760 |
false |
kudu_mutation_buffer_size (int32) |
The size (bytes) of the Kudu client buffer for mutations. |
10485760 |
10485760 |
false |
convert_legacy_hive_parquet_utc_timestamps (bool) |
When true, TIMESTAMPs read from files written by Parquet-MR (used by Hive) will be converted from UTC to local time. Writes are unaffected. |
false |
false |
false |
max_page_header_size (int32) |
max parquet page header size in bytes |
8388608 |
8388608 |
false |
max_queued_row_batch_bytes (int64) |
(Advanced) the maximum bytes of queued rows per multithreaded scan node. |
16777216 |
16777216 |
false |
runtime_filter_wait_time_ms (int32) |
(Advanced) the maximum time, in ms, that a scan node will wait for expected runtime filters to arrive. |
1000 |
1000 |
false |
hdfs_zone_alias_conf (string) |
HDFS/S3A/ADLS path to config file defining non-standard time-zone aliases. |
|
|
false |
hdfs_zone_info_zip (string) |
HDFS/S3A/ADLS path to a zip archive of the IANA time-zone database to use. |
|
|
false |
rpc_acceptor_listen_backlog (int32) |
Socket backlog parameter used when listening for RPC connections. This defines the maximum length to which the queue of pending TCP connections inbound to the RPC server may grow. If a connection request arrives when the queue is full, the client may receive an error. Higher values may help the server ride over bursts of new inbound connection requests. |
128 |
128 |
false |
rpc_encrypt_loopback_connections (bool) |
Whether to encrypt data transfer on RPC connections that stay within a single host. Encryption here is likely to offer no additional security benefit since only a local 'root' user could intercept the traffic, and wire encryption does not suitably protect against such an attacker. |
false |
false |
false |
rpc_negotiation_inject_delay_ms (int32) |
If enabled, injects the given number of milliseconds delay into the RPC negotiation process on the server side. |
0 |
0 |
false |
rpc_trace_negotiation (bool) |
If enabled, dump traces of all RPC negotiations to the log |
false |
false |
false |
rpc_callback_max_cycles (int64) |
The maximum number of cycles for which an RPC callback should be allowed to run without emitting a warning. (Advanced debugging option) |
100000000 |
100000000 |
false |
rpc_inject_cancellation_state (int32) |
If this flag is not -1, it is the state in which a cancellation request will be injected. Should use values in OutboundCall::State only |
-1 |
-1 |
false |
rpc_reopen_outbound_connections (bool) |
Open a new connection to the server for every RPC call. If not enabled, an already existing connection to a server is reused upon making another call to the same server. When this flag is enabled, an already existing _idle_ connection to the server is closed upon making another RPC call which would reuse the connection otherwise. Used by tests only. |
false |
false |
false |
remember_clients_ttl_ms (int64) |
Maximum amount of time, in milliseconds, the server "remembers" a client for the purpose of caching its responses. After this period without hearing from it, the client is no longer remembered and the memory occupied by its responses is reclaimed. Retries of requests older than 'remember_clients_ttl_ms' are treated as new ones. |
3600000 |
3600000 |
false |
remember_responses_ttl_ms (int64) |
Maximum amount of time, in milliseconds, the server "remembers" a response to a specific request for a client. After this period has elapsed, the response may have been garbage collected and the client might get a response indicating the request is STALE. |
600000 |
600000 |
false |
result_tracker_gc_interval_ms (int64) |
Interval at which the result tracker will look for entries to GC. |
1000 |
1000 |
false |
rpc_dump_all_traces (bool) |
If true, dump all RPC traces at INFO level |
false |
false |
false |
rpc_duration_too_long_ms (int32) |
Threshold (in milliseconds) above which a RPC is considered too long and its duration and method name are logged at INFO level. The time measured is between when a RPC is accepted and when its call handler completes. |
1000 |
120000 |
false |
rpc_inject_invalid_authn_token_ratio (double) |
If set higher than 0, AuthenticateByToken() randomly injects errors replying with FATAL_INVALID_AUTHENTICATION_TOKEN code. The flag's value corresponds to the probability of the fault injection event. Used for only for tests. |
0 |
0 |
false |
trusted_subnets (string) |
A trusted subnet whitelist. If set explicitly, all unauthenticated or unencrypted connections are prohibited except the ones from the specified address blocks. Otherwise, private network (127.0.0.0/8, etc.) and local subnets of all local network interfaces will be used. Set it to '0.0.0.0/0' to allow unauthenticated/unencrypted connections from all remote IP addresses. However, if network access is not otherwise restricted by a firewall, malicious users may be able to gain unauthorized access. |
127.0.0.0/8,10.0.0.0/8,172.16.0.0/12,192.168.0.0/16,169.254.0.0/16 |
127.0.0.0/8,10.0.0.0/8,172.16.0.0/12,192.168.0.0/16,169.254.0.0/16 |
false |
enable_exactly_once (bool) |
Whether to enable exactly once semantics. |
true |
true |
false |
use_system_auth_to_local (bool) |
When enabled, use the system krb5 library to map Kerberos principal names to local (short) usernames. If not enabled, the first component of the principal will be used as the short name. For example, 'kudu/foo.example.com@EXAMPLE' will map to 'kudu'. |
true |
true |
false |
ipki_server_key_size (int32) |
the number of bits for server cert's private key. The server cert is used for TLS connections to and from clients and other servers. |
2048 |
2048 |
false |
cache_force_single_shard (bool) |
Override all cache implementations to use just one shard |
false |
false |
false |
cache_memtracker_approximation_ratio (double) |
The MemTracker associated with a cache can accumulate error up to this ratio to improve performance. For tests. |
0.01 |
0.01 |
false |
trace_to_console (string) |
Trace pattern specifying which trace events should be dumped directly to the console |
|
|
false |
crash_on_eio (bool) |
Kill the process if an I/O operation results in EIO. If false, I/O resulting in EIOs will return the status IOError and leave error-handling up to the caller. |
false |
false |
false |
env_inject_eio (double) |
Fraction of the time that operations on certain files will fail with the posix code EIO. |
0 |
0 |
false |
env_inject_eio_globs (string) |
Comma-separated list of glob patterns specifying files on which I/O will fail. By default, all files may cause a failure. |
* |
* |
false |
env_inject_lock_failure_globs (string) |
Comma-separated list of glob patterns specifying files on which attempts to obtain a file lock will fail. By default, no files will fail. |
|
|
false |
env_inject_short_read_bytes (int32) |
The number of bytes less than the requested bytes to read |
0 |
0 |
false |
env_inject_short_write_bytes (int32) |
The number of bytes less than the requested bytes to write |
0 |
0 |
false |
env_use_fsync (bool) |
Use fsync(2) instead of fdatasync(2) for synchronizing dirty data to disk. |
false |
false |
false |
env_use_ioctl_hole_punch_on_xfs (bool) |
Use the XFS_IOC_UNRESVSP64 ioctl instead of fallocate(2) to punch holes on XFS filesystems. |
true |
true |
false |
never_fsync (bool) |
Never fsync() anything to disk. This is used by certain test cases to speed up runtime. This is very unsafe to use in production. |
false |
false |
false |
disk_reserved_bytes_free_for_testing (int64) |
For testing only! Set to number of bytes free on each filesystem. Set to -1 to disable this test-specific override |
-1 |
-1 |
false |
disk_reserved_override_prefix_1_bytes_free_for_testing (int64) |
For testing only! Set number of bytes free on the path prefix specified by --disk_reserved_override_prefix_1_path_for_testing. Set to -1 to disable. |
-1 |
-1 |
false |
disk_reserved_override_prefix_1_path_for_testing (string) |
For testing only! Specifies a prefix to override the visible 'bytes free' on. Use --disk_reserved_override_prefix_1_bytes_free_for_testing to set the number of bytes free for this path prefix. Set to empty string to disable. |
|
|
false |
disk_reserved_override_prefix_2_bytes_free_for_testing (int64) |
For testing only! Set number of bytes free on the path prefix specified by --disk_reserved_override_prefix_2_path_for_testing. Set to -1 to disable. |
-1 |
-1 |
false |
disk_reserved_override_prefix_2_path_for_testing (string) |
For testing only! Specifies a prefix to override the visible 'bytes free' on. Use --disk_reserved_override_prefix_2_bytes_free_for_testing to set the number of bytes free for this path prefix. Set to empty string to disable. |
|
|
false |
disable_core_dumps (bool) |
Disable core dumps when this process crashes. |
false |
false |
false |
dump_metrics_json (bool) |
Dump a JSON document describing all of the metrics which may be emitted by this binary. |
false |
false |
false |
redact (string) |
Comma-separated list that controls redaction context. Supported options are 'all','log', and 'none'. If 'all' is specified, sensitive data (sensitive configuration flags and row data) will be redacted from the web UI as well as glog and error messages. If 'log' is specified, sensitive data will only be redacted from glog and error messages. If 'none' is specified, no redaction will occur. |
all |
all |
false |
umask (string) |
The umask that will be used when creating files and directories. Permissions of top-level data directories will also be modified at start-up to conform to the given umask. Changing this value may enable unauthorized local users to read or modify data stored by Kudu. |
077 |
077 |
false |
unlock_experimental_flags (bool) |
Unlock flags marked as 'experimental'. These flags are not guaranteed to be maintained across releases of Kudu, and may enable features or behavior known to be unstable. Use at your own risk. |
false |
false |
false |
unlock_unsafe_flags (bool) |
Unlock flags marked as 'unsafe'. These flags are not guaranteed to be maintained across releases of Kudu, and enable features or behavior known to be unsafe. Use at your own risk. |
false |
false |
false |
hung_task_check_interval_ms (int32) |
Number of milliseconds in between checks for hung threads |
200 |
200 |
false |
inject_latency_on_kernel_stack_lookup_ms (int32) |
Number of milliseconds of latency to inject when reading a thread's kernel stack |
0 |
0 |
false |
log_async (bool) |
Enable asynchronous writing to log files. This improves latency and stability. |
true |
true |
false |
log_async_buffer_bytes_per_level (int32) |
The number of bytes of buffer space used by each log level. Only relevant when --log_async is enabled. |
2097152 |
2097152 |
false |
allocator_aligned_mode (bool) |
Use 16-byte alignment instead of 8-byte, unless explicitly specified otherwise - to boost SIMD |
false |
false |
false |
metrics_retirement_age_ms (int32) |
The minimum number of milliseconds a metric will be kept for after it is no longer active. (Advanced option) |
120000 |
120000 |
false |
fail_dns_resolution (bool) |
Whether to fail all dns resolution, for tests. |
false |
false |
false |
local_ip_for_outbound_sockets (string) |
IP to bind to when making outgoing socket connections. This must be an IP address of the form A.B.C.D, not a hostname. Advanced parameter, subject to change. |
|
|
false |
socket_inject_short_recvs (bool) |
Inject short recv() responses which return less data than requested |
false |
false |
false |
memory_limit_hard_bytes (int64) |
Maximum amount of memory this daemon should use, in bytes. A value of 0 autosizes based on the total system memory. A value of -1 disables all memory limiting. |
0 |
0 |
false |
memory_limit_soft_percentage (int32) |
Percentage of the hard memory limit that this daemon may consume before memory throttling of writes begins. The greater the excess, the higher the chance of throttling. In general, a lower soft limit leads to smoother write latencies but decreased throughput, and vice versa for a higher soft limit. |
80 |
80 |
false |
memory_limit_warn_threshold_percentage (int32) |
Percentage of the hard memory limit that this daemon may consume before WARNING level messages are periodically logged. |
98 |
98 |
false |
memory_pressure_percentage (int32) |
Percentage of the hard memory limit that this daemon may consume before flushing of in-memory data becomes prioritized. |
60 |
60 |
false |
thread_inject_start_latency_ms (int32) |
Number of ms to sleep when starting a new thread. (For tests). |
0 |
0 |
false |
accepted_cnxn_queue_depth (int32) |
(Advanced) The size of the post-accept, pre-setup connection queue in each thrift server set up to service Impala internal and external connections. |
10000 |
10000 |
false |
accepted_cnxn_setup_thread_pool_size (int32) |
(Advanced) The size of the thread pool that is used to process the post-accept, pre-setup connection queue in each thrift server set up to service Impala internal and external connections. |
2 |
2 |
false |
enable_ldap_auth (bool) |
If true, use LDAP authentication for client connections |
false |
false |
false |
internal_principals_whitelist (string) |
(Advanced) Comma-separated list of additional usernames authorized to access Impala's internal APIs. Defaults to 'hdfs' which is the system user that in certain deployments must access catalog server APIs. |
hdfs |
hdfs |
false |
ldap_allow_anonymous_binds (bool) |
(Advanced) If true, LDAP authentication with a blank password (an 'anonymous bind') is allowed by Impala. |
false |
false |
false |
ldap_baseDN (string) |
If set, Impala will try to bind to LDAP with a name of the form uid=<userid>,<ldap_baseDN> |
|
|
false |
ldap_bind_pattern (string) |
If set, Impala will try to bind to LDAP with a name of <ldap_bind_pattern>, but where the string #UID is replaced by the user ID. Use to control the bind name precisely; do not set --ldap_domain or --ldap_baseDN with this option |
|
|
false |
ldap_ca_certificate (string) |
The full path to the certificate file used to authenticate the LDAP server's certificate for SSL / TLS connections. |
|
|
false |
ldap_domain (string) |
If set, Impala will try to bind to LDAP with a name of the form <userid>@<ldap_domain> |
|
|
false |
ldap_manual_config (bool) |
Obsolete; Ignored |
false |
false |
false |
ldap_passwords_in_clear_ok (bool) |
If set, will allow LDAP passwords to be sent in the clear (without TLS/SSL) over the network. This option should not be used in production environments |
false |
false |
false |
ldap_tls (bool) |
If true, use the secure TLS protocol to connect to the LDAP server |
false |
false |
false |
ldap_uri (string) |
The URI of the LDAP server to authenticate users against |
|
|
false |
sasl_path (string) |
Colon separated list of paths to look for SASL security library plugins. |
|
|
false |
num_acceptor_threads (int32) |
Number of threads dedicated to accepting connection requests for RPC services |
2 |
2 |
false |
num_reactor_threads (int32) |
Number of threads dedicated to managing network IO for RPC services. If left at default value 0, it will be set to number of CPU cores. |
0 |
0 |
false |
rpc_negotiation_thread_count (int32) |
Maximum number of threads dedicated to handling RPC connection negotiations. |
64 |
64 |
false |
rpc_negotiation_timeout_ms (int32) |
Time in milliseconds of waiting for a negotiation to complete before timing out. |
300000 |
300000 |
false |
rpc_retry_interval_ms (int32) |
Time in millisecond of waiting before retrying an RPC when remote is busy |
5 |
5 |
false |
concurrent_scratch_ios_per_device (int32) |
Set this to influence the number of concurrent write I/Os issues to write data to scratch files. This is multiplied by the number of active scratch directories to obtain the target number of scratch write I/Os per query. |
2 |
2 |
false |
madvise_huge_pages (bool) |
(Advanced) If true, advise operating system to back large memory buffers with huge pages |
true |
true |
false |
mmap_buffers (bool) |
(Experimental) If true, allocate buffers directly from the operating system instead of with TCMalloc. |
false |
false |
false |
insert_inherit_permissions (bool) |
If true, new directories created by INSERTs will inherit the permissions of their parent directories |
false |
false |
false |
backend_client_connection_num_retries (int32) |
Retry backend connections. |
3 |
3 |
false |
backend_client_rpc_timeout_ms (int32) |
(Advanced) The underlying TSocket send/recv timeout in milliseconds for a backend client RPC. |
300000 |
300000 |
false |
catalog_client_connection_num_retries (int32) |
Retry catalog connections. |
3 |
3 |
false |
catalog_client_rpc_retry_interval_ms (int32) |
(Advanced) The time to wait before retrying when the catalog RPC client fails to connect to catalogd. |
10000 |
10000 |
false |
catalog_client_rpc_timeout_ms (int32) |
(Advanced) The underlying TSocket send/recv timeout in milliseconds for a catalog client RPC. |
0 |
0 |
false |
catalog_service_host (string) |
hostname where CatalogService is running |
localhost |
hadoop003 |
false |
coordinator_rpc_threads (int32) |
(Advanced) Number of threads available to start fragments on remote Impala daemons. |
12 |
12 |
false |
enable_webserver (bool) |
If true, debug webserver is enabled |
true |
true |
false |
num_hdfs_worker_threads (int32) |
(Advanced) The number of threads in the global HDFS operation pool |
16 |
16 |
false |
state_store_host (string) |
hostname where StatestoreService is running |
localhost |
hadoop001 |
false |
state_store_subscriber_port (int32) |
port where StatestoreSubscriberService should be exported |
23000 |
23000 |
false |
s3a_access_key_cmd (string) |
A Unix command whose output returns the access key to S3, i.e. "fs.s3a.access.key". |
|
|
false |
s3a_secret_key_cmd (string) |
A Unix command whose output returns the secret key to S3, i.e. "fs.s3a.secret.key". |
|
|
false |
data_cache_checksum (bool) |
(Advanced) Enable checksumming for the cached buffer. |
false |
false |
false |
data_cache_file_max_size_bytes (int64) |
(Advanced) The maximum size which a cache file can grow to before data stops being appended to it. |
1099511627776 |
1099511627776 |
false |
data_cache_max_opened_files (int32) |
(Advanced) The maximum number of allowed opened files. This must be at least the number of specified partitions. |
1000 |
1000 |
false |
data_cache_write_concurrency (int32) |
(Advanced) Number of concurrent threads allowed to insert into the cache per partition. |
1 |
1 |
false |
cache_remote_file_handles (bool) |
Enable the file handle cache for remote HDFS files. |
true |
true |
false |
data_cache (string) |
The configuration string for IO data cache. Default to be an empty string so it's disabled. The configuration string is expected to be a list of directories, separated by ',', followed by a ':' and a capacity quota per directory. For example /data/0,/data/1:1TB means the cache may use up to 2TB, with 1TB max in /data/0 and /data/1 respectively. Please note that each Impala daemon on a host must have a unique caching directory. |
|
|
false |
max_cached_file_handles (uint64) |
Maximum number of HDFS file handles that will be cached. Disabled if set to 0. |
20000 |
20000 |
false |
num_abfs_io_threads (int32) |
Number of ABFS I/O threads |
16 |
16 |
false |
num_adls_io_threads (int32) |
Number of ADLS I/O threads |
16 |
16 |
false |
num_disks (int32) |
Number of disks on data node. |
0 |
0 |
false |
num_file_handle_cache_partitions (uint64) |
Number of partitions used by the file handle cache. |
16 |
16 |
false |
num_io_threads_per_rotational_disk (int32) |
Number of I/O threads per rotational disk. Has priority over num_threads_per_disk. If neither is set, defaults to 1 thread(s) per rotational disk |
0 |
0 |
false |
num_io_threads_per_solid_state_disk (int32) |
Number of I/O threads per solid state disk. Has priority over num_threads_per_disk. If neither is set, defaults to 8 thread(s) per solid state disk |
0 |
0 |
false |
num_remote_hdfs_io_threads (int32) |
Number of remote HDFS I/O threads |
8 |
8 |
false |
num_s3_io_threads (int32) |
Number of S3 I/O threads |
16 |
16 |
false |
num_threads_per_disk (int32) |
Number of I/O threads per disk |
0 |
0 |
false |
unused_file_handle_timeout_sec (uint64) |
Maximum time, in seconds, that an unused HDFS file handle will remain in the file handle cache. Disabled if set to 0. |
21600 |
21600 |
false |
always_use_data_cache (bool) |
(Advanced) Always uses the IO data cache for all reads, regardless of whether the read is local or remote. By default, the IO data cache is only used if the data is expected to be remote. Used by tests. |
false |
false |
false |
use_hdfs_pread (bool) |
Enables using hdfsPread() instead of hdfsRead() when performing HDFS read operations. This is necessary to use HDFS hedged reads (assuming the HDFS client is configured to do so). |
false |
false |
false |
hdfs_operation_timeout_sec (uint64) |
Maximum time, in seconds, that an HDFS operation should wait before timing out and failing. |
300 |
300 |
false |
abfs_read_chunk_size (int64) |
The maximum read chunk size to use when reading from ABFS. |
131072 |
131072 |
false |
adls_read_chunk_size (int64) |
The maximum read chunk size to use when reading from ADLS. |
131072 |
131072 |
false |
datastream_sender_timeout_ms (int32) |
(Advanced) The time, in ms, that can elapse before a plan fragment will time-out trying to send the initial row batch. |
120000 |
120000 |
false |
datastream_service_deserialization_queue_size (int32) |
Number of deferred RPC requests that can be enqueued before being processed by a deserialization thread. |
10000 |
10000 |
false |
datastream_service_num_deserialization_threads (int32) |
Number of threads for deserializing RPC requests deferred due to the receiver not ready or the soft limit of the receiver is reached. |
16 |
16 |
false |
log_mem_usage_interval (int32) |
If non-zero, impalad will output memory usage every log_mem_usage_interval'th fragment completion. |
0 |
0 |
false |
status_report_interval_ms (int32) |
Interval between profile reports in milliseconds. If set to <= 0, periodic reporting is disabled. |
5000 |
5000 |
false |
status_report_max_retries (int32) |
Max number of times to retry sending the status report before cancelling |
3 |
3 |
false |
max_filter_error_rate (double) |
(Advanced) The maximum probability of false positives in a runtime filter before it is disabled. |
0.75 |
0.75 |
false |
num_threads_per_core (int32) |
Number of threads per core. |
3 |
3 |
false |
use_local_tz_for_unix_timestamp_conversions (bool) |
When true, TIMESTAMPs are interpreted in the local time zone when converting to and from Unix times. When false, TIMESTAMPs are interpreted in the UTC time zone. Set to true for Hive compatibility. |
false |
false |
false |
allow_multiple_scratch_dirs_per_device (bool) |
If false and --scratch_dirs contains multiple directories on the same device, then only the first writable directory is used |
true |
true |
false |
disk_spill_encryption (bool) |
Set this to encrypt and perform an integrity check on all data spilled to disk during a query |
true |
false |
false |
scratch_dirs (string) |
Writable scratch directories |
/tmp |
/data1/impala/impalad,/data2/impala/impalad |
false |
queue_wait_timeout_ms (int64) |
Maximum amount of time (in milliseconds) that a request will wait to be admitted before timing out. |
60000 |
60000 |
false |
default_pool_max_queued (int64) |
Maximum number of requests allowed to be queued before rejecting requests. A negative value or 0 indicates requests will always be rejected once the maximum number of concurrent requests are executing. Ignored if fair_scheduler_config_path and llama_site_path are set. |
200 |
200 |
false |
default_pool_max_requests (int64) |
Maximum number of concurrent outstanding requests allowed to run before queueing incoming requests. A negative value indicates no limit. 0 indicates no requests will be admitted. Ignored if fair_scheduler_config_path and llama_site_path are set. |
-1 |
-1 |
false |
default_pool_mem_limit (string) |
Maximum amount of memory that all outstanding requests in this pool may use before new requests to this pool are queued. Specified as number of bytes ('<int>[bB]?'), megabytes ('<float>[mM]'), gigabytes ('<float>[gG]'), or percentage of the physical memory ('<int>%'). Defaults to bytes if no unit is given. Ignored if fair_scheduler_config_path and llama_site_path are set. |
|
|
false |
disable_pool_max_requests (bool) |
Disables all per-pool limits on the maximum number of running requests. |
false |
false |
false |
disable_pool_mem_limits (bool) |
Disables all per-pool mem limits. |
false |
false |
false |
fair_scheduler_allocation_path (string) |
Path to the fair scheduler allocation file (fair-scheduler.xml). |
|
/var/run/cloudera-scm-agent/process/2676-impala-IMPALAD/impala-conf/fair-scheduler.xml |
false |
llama_site_path (string) |
Path to the Llama configuration file (llama-site.xml). If set, fair_scheduler_allocation_path must also be set. |
|
/var/run/cloudera-scm-agent/process/2676-impala-IMPALAD/impala-conf/llama-site.xml |
false |
require_username (bool) |
Requires that a user be provided in order to schedule requests. If enabled and a user is not provided, requests will be rejected, otherwise requests without a username will be submitted with the username 'default'. |
false |
false |
false |
control_service_num_svc_threads (int32) |
Number of threads for processing control service's RPCs. if left at default value 0, it will be set to number of CPU cores. Set it to a positive value to change from the default. |
0 |
0 |
false |
control_service_queue_mem_limit (string) |
(Advanced) Limit on RPC payloads consumption for ControlService. Specified as number of bytes ('<int>[bB]?'), megabytes ('<float>[mM]'), gigabytes ('<float>[gG]'), or percentage of the process memory limit ('<int>%'). Defaults to bytes if no unit is given. |
50MB |
50MB |
false |
datastream_service_num_svc_threads (int32) |
Number of threads for processing datastream services' RPCs. If left at default value 0, it will be set to number of CPU cores. Set it to a positive value to change from the default. |
0 |
0 |
false |
datastream_service_queue_mem_limit (string) |
(Advanced) Limit on RPC payloads consumption for DataStreamService. Specified as number of bytes ('<int>[bB]?'), megabytes ('<float>[mM]'), gigabytes ('<float>[gG]'), or percentage of the process memory limit ('<int>%'). Defaults to bytes if no unit is given. |
5% |
5% |
false |
authorization_policy_file (string) |
HDFS path to the authorization policy file. If set, authorization will be enabled and the authorization policy will be read from a file. |
|
|
false |
authorization_policy_provider_class (string) |
Advanced: The authorization policy provider class name. |
org.apache.sentry.provider.common.HadoopGroupResourceAuthorizationProvider |
org.apache.sentry.provider.common.HadoopGroupResourceAuthorizationProvider |
false |
authorized_proxy_group_config (string) |
Specifies the set of authorized proxy groups (users who can delegate to other users belonging to the specified groups during authorization) and whom they are allowed to delegate. Input is a semicolon-separated list of key=value pairs of authorized proxy users to the group(s) they can delegate to. These groups are specified as a list of groups separated by a delimiter (which defaults to comma and may be changed via --authorized_proxy_group_config_delimiter), or '*' to indicate all users. For example: hue=group1,group2;admin=* |
|
|
false |
authorized_proxy_group_config_delimiter (string) |
Specifies the delimiter used in authorized_proxy_group_config. |
, |
, |
false |
authorized_proxy_user_config (string) |
Specifies the set of authorized proxy users (users who can delegate to other users during authorization) and whom they are allowed to delegate. Input is a semicolon-separated list of key=value pairs of authorized proxy users to the user(s) they can delegate to. These users are specified as a list of short usernames separated by a delimiter (which defaults to comma and may be changed via --authorized_proxy_user_config_delimiter), or '*' to indicate all users. For example: hue=user1,user2;admin=* |
|
|
false |
authorized_proxy_user_config_delimiter (string) |
Specifies the delimiter used in authorized_proxy_user_config. |
, |
, |
false |
kudu_master_hosts (string) |
Specifies the default Kudu master(s). The given value should be a comma separated list of hostnames or IP addresses; ports are optional. |
|
|
false |
server_name (string) |
The name to use for securing this impalad server during authorization. Set to enable authorization. By default, the authorization policy will be loaded from the catalog server (via the statestore).To use a file based authorization policy, set --authorization_policy_file. |
|
|
false |
abort_on_failed_audit_event (bool) |
Shutdown Impala if there is a problem recording an audit event. |
true |
true |
false |
abort_on_failed_lineage_event (bool) |
Shutdown Impala if there is a problem recording a lineage record. |
true |
true |
false |
accepted_client_cnxn_timeout (int64) |
(Advanced) The amount of time in milliseconds an accepted connection will wait in the post-accept, pre-setup connection queue before it is timed out and the connection request is rejected. A value of 0 means there is no timeout. |
300000 |
300000 |
false |
audit_event_log_dir (string) |
The directory in which audit event log files are written. Setting this flag will enable audit event logging. |
|
|
false |
beeswax_port (int32) |
port on which Beeswax client requests are served.If 0 or less, the Beeswax server is not started. |
21000 |
21000 |
false |
cancellation_thread_pool_size (int32) |
(Advanced) Size of the thread-pool processing cancellations due to node failure |
5 |
5 |
false |
default_query_options (string) |
key=value pair of default query options for impalad, separated by ',' |
|
|
false |
fe_service_threads (int32) |
number of threads available to serve client requests |
64 |
64 |
false |
hs2_port (int32) |
port on which HiveServer2 client requests are served.If 0 or less, the HiveServer2 server is not started. |
21050 |
21050 |
false |
idle_query_timeout (int32) |
The time, in seconds, that a query may be idle for (i.e. no processing work is done and no updates are received from the client) before it is cancelled. If 0, idle queries are never expired. The query option QUERY_TIMEOUT_S overrides this setting, but, if set, --idle_query_timeout represents the maximum allowable timeout. |
0 |
0 |
false |
idle_session_timeout (int32) |
The time, in seconds, that a session may be idle for before it is closed (and all running queries cancelled) by Impala. If 0, idle sessions are never expired. It can be overridden by the query option 'idle_session_timeout' for specific sessions |
0 |
0 |
false |
is_coordinator (bool) |
If true, this Impala daemon can accept and coordinate queries from clients. If false, it will refuse client connections. |
true |
true |
false |
is_executor (bool) |
If true, this Impala daemon will execute query fragments. |
true |
true |
false |
lineage_event_log_dir (string) |
The directory in which lineage event log files are written. Setting this flag with enable lineage logging. |
|
/var/log/impalad/lineage |
false |
log_query_to_file (bool) |
if true, logs completed query profiles to file. |
true |
true |
false |
max_audit_event_log_file_size (int32) |
The maximum size (in queries) of the audit event log file before a new one is created (if event logging is enabled) |
5000 |
5000 |
false |
max_lineage_log_file_size (int32) |
The maximum size (in queries) of the lineage event log file before a new one is created (if lineage logging is enabled) |
5000 |
5000 |
false |
max_profile_log_file_size (int32) |
The maximum size (in queries) of the profile log file before a new one is created |
5000 |
5000 |
false |
max_profile_log_files (int32) |
Maximum number of profile log files to retain. The most recent log files are retained. If set to 0, all log files are retained. |
10 |
10 |
false |
max_result_cache_size (int64) |
Maximum number of query results a client may request to be cached on a per-query basis to support restarting fetches. This option guards against unreasonably large result caches requested by clients. Requests exceeding this maximum will be rejected. |
100000 |
100000 |
false |
profile_log_dir (string) |
The directory in which profile log files are written. If blank, defaults to <log_file_dir>/profiles |
|
/var/log/impalad/profiles/ |
false |
query_log_size (int32) |
Number of queries to retain in the query log. If -1, the query log has unbounded size. |
25 |
25 |
false |
shutdown_deadline_s (int64) |
Default time limit in seconds for the shut down process. If this duration elapses after the shut down process is started, the daemon shuts down regardless of any running queries. |
3600 |
3600 |
false |
shutdown_grace_period_s (int64) |
Shutdown startup grace period in seconds. When the shutdown process is started for this daemon, it will wait for at least the startup grace period before shutting down. This gives time for updated cluster membership information to propagate to all coordinators and for fragment instances that were scheduled based on old cluster membership to start executing (and therefore be reflected in the metrics used to detect quiescence). |
120 |
120 |
false |
ssl_cipher_list (string) |
The cipher suite preferences to use for TLS-secured Thrift RPC connections. Uses the OpenSSL cipher preference list format. See man (1) ciphers for more information. If empty, the default cipher list for your platform is used |
|
|
false |
ssl_client_ca_certificate (string) |
(Advanced) The full path to a certificate used by Thrift clients to check the validity of a server certificate. May either be a certificate for a third-party Certificate Authority, or a copy of the certificate the client expects to receive from the server. |
|
|
false |
ssl_minimum_version (string) |
The minimum SSL/TLS version that Thrift services should use for both client and server connections. Supported versions are TLSv1.0, TLSv1.1 and TLSv1.2 (as long as the system OpenSSL library supports them) |
tlsv1 |
tlsv1 |
false |
ssl_private_key (string) |
The full path to the private key used as a counterpart to the public key contained in --ssl_server_certificate. If --ssl_server_certificate is set, this option must be set as well. |
|
|
false |
ssl_private_key_password_cmd (string) |
A Unix command whose output returns the password used to decrypt the certificate private key file specified in --ssl_private_key. If the .PEM key file is not password-protected, this command will not be invoked. The output of the command will be truncated to 1024 bytes, and then all trailing whitespace will be trimmed before it is used to decrypt the private key |
|
|
false |
ssl_server_certificate (string) |
The full path to the SSL certificate file used to authenticate Impala to clients. If set, both Beeswax and HiveServer2 ports will only accept SSL connections |
|
|
false |
statestore_subscriber_cnxn_attempts (int32) |
The number of times to retry an RPC connection to the statestore. A setting of 0 means retry indefinitely |
10 |
10 |
false |
statestore_subscriber_cnxn_retry_interval_ms (int32) |
The interval, in ms, to wait between attempts to make an RPC connection to the statestore. |
3000 |
3000 |
false |
statestore_subscriber_timeout_seconds (int32) |
The amount of time (in seconds) that may elapse before the connection with the statestore is considered lost. |
30 |
30 |
false |
state_store_port (int32) |
port where StatestoreService is running |
24000 |
24000 |
false |
statestore_heartbeat_frequency_ms (int32) |
(Advanced) Frequency (in ms) with which the statestore sends heartbeat heartbeats to subscribers. |
1000 |
1000 |
false |
statestore_heartbeat_tcp_timeout_seconds (int32) |
(Advanced) The time after which a heartbeat RPC to a subscriber will timeout. This setting protects against badly hung machines that are not able to respond to the heartbeat RPC in short order |
3 |
3 |
false |
statestore_max_missed_heartbeats (int32) |
Maximum number of consecutive heartbeat messages an impalad can miss before being declared failed by the statestore. |
10 |
10 |
false |
statestore_max_subscribers (int32) |
Used to control the maximum size of the pending topic-update queue. There is at most one entry per subscriber. |
10000 |
10000 |
false |
statestore_num_heartbeat_threads (int32) |
(Advanced) Number of threads used to send heartbeats in parallel to all registered subscribers. |
10 |
10 |
false |
statestore_num_priority_update_threads (int32) |
(Advanced) Number of threads used to send prioritized topic updates in parallel to all registered subscribers. |
10 |
10 |
false |
statestore_num_update_threads (int32) |
(Advanced) Number of threads used to send topic updates in parallel to all registered subscribers. |
10 |
10 |
false |
statestore_priority_update_frequency_ms (int32) |
(Advanced) Frequency (in ms) with which the statestore sends prioritized topic updates to subscribers. |
100 |
100 |
false |
statestore_update_frequency_ms (int32) |
(Advanced) Frequency (in ms) with which the statestore sends topic updates to subscribers. |
2000 |
2000 |
false |
statestore_update_tcp_timeout_seconds (int32) |
(Advanced) The time after which an update RPC to a subscriber will timeout. This setting protects against badly hung machines that are not able to respond to the update RPC in short order. |
300 |
300 |
false |
force_lowercase_usernames (bool) |
If true, all principals and usernames are mapped to lowercase shortnames before being passed to any components (Sentry, admission control) for authorization |
false |
false |
false |
sasl_connect_tcp_timeout_ms (int32) |
(Advanced) The underlying TSocket send/recv timeout in milliseconds for the initial SASL handeshake. |
300000 |
300000 |
false |
num_cores (int32) |
(Advanced) If > 0, it sets the number of cores available to Impala. Setting it to 0 means Impala will use all available cores on the machine according to /proc/cpuinfo. |
0 |
0 |
false |
web_log_bytes (int64) |
The maximum number of bytes to display on the debug webserver's log page |
1048576 |
1048576 |
false |
jvm_deadlock_detector_interval_s (int64) |
(Advanced) Interval between JVM deadlock checks. If set to 0 or a negative value, deadlock checks are disabled. |
60 |
60 |
false |
non_impala_java_vlog (int32) |
(Advanced) The log level (equivalent to --v) for non-Impala Java classes (0: INFO, 1 and 2: DEBUG, 3: TRACE) |
0 |
0 |
false |
periodic_counter_update_period_ms (int32) |
Period to update rate counters and sampling counters in ms |
500 |
500 |
false |
enable_webserver_doc_root (bool) |
If true, webserver may serve static files from the webserver_doc_root |
true |
true |
false |
webserver_authentication_domain (string) |
Domain used for debug webserver authentication |
|
|
false |
webserver_certificate_file (string) |
The location of the debug webserver's SSL certificate file, in .pem format. If empty, webserver SSL support is not enabled |
|
|
false |
webserver_doc_root (string) |
Files under <webserver_doc_root>/www are accessible via the debug webserver. Defaults to $IMPALA_HOME, or if $IMPALA_HOME is not set, disables the document root |
/opt/cloudera/parcels/CDH-6.3.2-1.cdh6.3.2.p0.1605554/lib/impala |
/opt/cloudera/parcels/CDH-6.3.2-1.cdh6.3.2.p0.1605554/lib/impala |
false |
webserver_interface (string) |
Interface to start debug webserver on. If blank, webserver binds to 0.0.0.0 |
|
|
false |
webserver_password_file (string) |
(Optional) Location of .htpasswd file containing user names and hashed passwords for debug webserver authentication |
|
|
false |
webserver_port (int32) |
Port to start debug webserver on |
25000 |
25000 |
false |
webserver_private_key_file (string) |
The full path to the private key used as a counterpart to the public key contained in --webserver_certificate_file. If --webserver_certificate_file is set, this option must be set as well. |
|
|
false |
webserver_private_key_password_cmd (string) |
A Unix command whose output returns the password used to decrypt the Webserver's certificate private key file specified in --webserver_private_key_file. If the .PEM key file is not password-protected, this command will not be invoked. The output of the command will be truncated to 1024 bytes, and then all trailing whitespace will be trimmed before it is used to decrypt the private key |
|
|
false |
webserver_x_frame_options (string) |
webserver will add X-Frame-Options HTTP header with this value |
DENY |
DENY |
false |
flagfile (string) |
load flags from file |
|
/var/run/cloudera-scm-agent/process/2676-impala-IMPALAD/impala-conf/impalad_flags |
false |
fromenv (string) |
set flags from the environment [use 'export FLAGS_flag1=value'] |
|
|
false |
tryfromenv (string) |
set flags from the environment if present |
|
|
false |
undefok (string) |
comma-separated list of flag names that it is okay to specify on the command line even if the program does not define a flag with that name. IMPORTANT: flags in this list that have arguments MUST use the flag=value format |
|
|
false |
tab_completion_columns (int32) |
Number of columns to use in output for tab completion |
80 |
80 |
false |
tab_completion_word (string) |
If non-empty, HandleCommandLineCompletions() will hijack the process and attempt to do bash-style command line flag completion on this value. |
|
|
false |
help (bool) |
show help on all flags [tip: all flags can have two dashes] |
false |
false |
false |
helpfull (bool) |
show help on all flags -- same as -help |
false |
false |
false |
helpmatch (string) |
show help on modules whose name contains the specified substr |
|
|
false |
helpon (string) |
show help on the modules named by this flag value |
|
|
false |
helppackage (bool) |
show help on all modules in the main package |
false |
false |
false |
helpshort (bool) |
show help on only the main module for this program |
false |
false |
false |
helpxml (bool) |
produce an xml version of help |
false |
false |
false |
version (bool) |
show version and build info and exit |
false |
false |
false |
alsologtoemail (string) |
log messages go to these email addresses in addition to logfiles |
|
|
false |
alsologtostderr (bool) |
log messages go to stderr in addition to logfiles |
false |
false |
false |
colorlogtostderr (bool) |
color messages logged to stderr (if supported by terminal) |
false |
false |
false |
drop_log_memory (bool) |
Drop in-memory buffers of log contents. Logs can grow very quickly and they are rarely read before they need to be evicted from memory. Instead, drop them from memory as soon as they are flushed to disk. |
true |
true |
false |
log_backtrace_at (string) |
Emit a backtrace when logging at file:linenum. |
|
|
false |
log_dir (string) |
If specified, logfiles are written into this directory instead of the default logging directory. |
/var/log/impalad |
/var/log/impalad |
false |
log_link (string) |
Put additional links to the log files in this directory |
|
|
false |
log_prefix (bool) |
Prepend the log prefix to the start of each log line |
true |
true |
false |
logbuflevel (int32) |
Buffer log messages logged at this level or lower (-1 means don't buffer; 0 means buffer INFO only; ...) |
0 |
0 |
false |
logbufsecs (int32) |
Buffer log messages for at most this many seconds |
30 |
30 |
false |
logemaillevel (int32) |
Email log messages logged at this level or higher (0 means email all; 3 means email FATAL only; ...) |
999 |
999 |
false |
logmailer (string) |
Mailer used to send logging email |
/bin/mail |
/bin/mail |
false |
logtostderr (bool) |
log messages go to stderr instead of logfiles |
false |
false |
false |
max_log_size (int32) |
approx. maximum log file size (in MB). A value of 0 will be silently overridden to 1. |
200 |
200 |
false |
minloglevel (int32) |
Messages logged at a lower level than this don't actually get logged anywhere |
1 |
1 |
false |
stderrthreshold (int32) |
log messages at or above this level are copied to stderr in addition to logfiles. This flag obsoletes --alsologtostderr. |
2 |
4 |
false |
stop_logging_if_full_disk (bool) |
Stop attempting to log to disk if the disk is full. |
false |
false |
false |
symbolize_stacktrace (bool) |
Symbolize the stack trace in the tombstone |
true |
false |
false |
v (int32) |
Show all VLOG(m) messages for m <= this. Overridable by --vmodule. |
1 |
1 |
false |
vmodule (string) |
per-module verbose level. Argument is a comma-separated list of <module name>=<log level>. <module name> is a glob pattern, matched against the filename base (that is, name ignoring .cc/.h./-inl.h). <log level> overrides any value given by --v. |
|
|
false |