Black-box Evaluation
====================

``veeksha`` performs black-box evaluation of both proprietary and open-source systems.

Check out the following resources to learn how to run ``veeksha`` with both proprietary and open-source systems:

.. toctree::
    :maxdepth: 2
    
    public_apis
    open_source_systems

Following figures show evaluations by ``veeksha``:

.. _token_rate_comparison_api:

.. figure:: ../_static/assets/token_rate_comparison_api.png
    :alt: toke_rate_comparison_api
    :align: center
    
    **Token Rate Comparison**

Above figure depicts throughput measured by ``veeksha`` for different systems based on three different metrics:

* TPOT
* TBT
* *fluid-token-generation-rate*: Here we find minimum TBT latency such that 99% of requests have *fluidity-index* at least 0.9. Inverse of TBT latency is *fluid-token-generation-rate*.

.. _tbt_cdf_api:

.. figure:: ../_static/assets/tbt_cdf_api_1.png
    :alt: tbt_cdf_api
    :align: center
    
    **TBT CDF**

Above figure depicts TBT CDF for different systems. It is difficult to interpret the difference in TBT across different systems.

.. _tbt_acceptance_rate_curve:

.. figure:: ../_static/assets/tbt_acceptance_rate_curve.png
    :alt: tbt_acceptance_rate_curve
    :align: center
    
    **TBT Acceptance Rate Curve**

Above figure clearly highlights the difference in TBT across different systems which was difficult to interpret in previous figure, :ref:`tbt_cdf_api`.