About 22,300 results
Open links in new tab
  1. Statistics - Apache Doris

    Statistics Doris supports automatic or manual statistics collection for tables from external data sources like Hive, Iceberg and Paimon. The accuracy of statistics directly determines the …

  2. Basic Statistics - RDD-based API - Spark 4.0.0 Documentation

    Statistics provides methods to run Pearson’s chi-squared tests. The following example demonstrates how to run and interpret hypothesis tests. Refer to the Statistics Python docs for …

  3. Apache Doris: Open source data warehouse for real time data …

    Apache Doris is an open-source database based on MPP architecture,with easier use and higher performance. As a modern data warehouse, apache doris empowers your Olap query and …

  4. ANALYZE TABLE COMPUTE STATISTICS - Apache Drill

    You can run the ANALYZE TABLE COMPUTE STATISTICS statement at any time to compute statistics; however, you must enable the following option if you want Drill to use statistics …

  5. ANALYZE TABLE - Spark 3.5.3 Documentation

    The ANALYZE TABLE statement collects statistics about one specific table or all the tables in one specified database, that are to be used by the query optimizer to find a better query execution …

  6. Spark 3.5.3 ScalaDoc - org.apache.spark.mllib.stat.Statistics

    MultivariateStatisticalSummary object containing column-wise summary statistics.

  7. Statistics Collection | Apache Phoenix

    Parallelization in Phoenix is driven by the statistics related configuration parameters. Each chunk of data between guideposts will be run in parallel in a separate scan to improve query …

  8. MADlib: Statistics

    Jan 8, 2013 · Statistics Detailed Description A collection of probability and statistics modules.

  9. column_statistics - Apache Doris

    column_statistics Overview Column statistics Database __internal_schema Table Information ... Edit this page Report issue audit_log partition_statistics

  10. Geode Statistics List | Geode Docs

    Collected in the server, these statistics track event messages queued on the server to be sent to the client. The statistics are gathered for each client subscription queue and are incremental …