Friday, January 31, 2014

Custom indexes for an organization to help improve performance


For best performance, SOQL queries must be selective, particularly for queries inside of triggers. To avoid long execution times, non-selective SOQL queries may be terminated by the system. Developers will receive an error message when a non-selective query in a trigger executes against an object that contains more than 100,000 records. To avoid this error, ensure that the query is selective.

- Selective SOQL Query Criteria

        A query is selective when one of the query filters is on an indexed field and the query filter reduces the resulting number of rows below a system-defined threshold. The performance of the SOQL query improves when two or more filters used in the WHERE clause meet the mentioned conditions.
        The selectivity threshold is 10% of the records for the first million records and less than 5% of the records after the first million records, up to a maximum of 333,000 records. In some circumstances, for example with a query filter that is an indexed standard field, the threshold may be higher. Also, the selectivity threshold is subject to change.

- Custom Index Considerations for Selective SOQL Queries

        The following fields are indexed by default: primary keys (Id, Name and Owner fields), foreign keys (lookup or master-detail relationship fields), audit dates (such as LastModifiedDate), and custom fields marked as External ID or Unique.
        Salesforce.com Support can add custom indexes on request for customers.
        A custom index can't be created on these types of fields: multi-select picklists, currency fields in a multicurrency organization, long text fields, and binary fields (fields of type blob, file, or encrypted text.) Note that new data types, typically complex ones, may be added to Salesforce and fields of these types may not allow custom indexing.
        Typically, a custom index won't be used in these cases:
            The value(s) queried for exceeds the system-defined threshold mentioned above
            The filter operator is a negative operator such as NOT EQUAL TO (or !=), NOT CONTAINS, and NOT STARTS WITH
            The CONTAINS operator is used in the filter and the number of rows to be scanned exceeds 333,000. This is because the CONTAINS operator requires a full scan of the index. Note that this threshold is subject to change.
            When comparing with an empty value (Name != '')

        However, there are other complex scenarios in which custom indexes won't be used. Contact your salesforce.com representative if your scenario isn't covered by these cases or if you need further assistance with non-selective queries.

- Examples of Selective SOQL Queries
    To better understand whether a query on a large object is selective or not, let's analyze some queries. For these queries, we will assume there are more than 100,000 records (including soft-deleted records, that is, deleted records that are still in the Recycle Bin) for the Account sObject.

    Query 1:

    SELECT Id FROM Account WHERE Id IN ()

    The WHERE clause is on an indexed field (Id). If SELECT COUNT() FROM Account WHERE Id IN () returns fewer records than the selectivity threshold, the index on Id is used. This will typically be the case since the list of IDs only contains a small amount of records.
    Query 2:

    SELECT Id FROM Account WHERE Name != ''

    Since Account is a large object even though Name is indexed (primary key), this filter returns most of the records, making the query non-selective.
    Query 3:

    SELECT Id FROM Account WHERE Name != '' AND CustomField__c = 'ValueA'

    Here we have to see if each filter, when considered individually, is selective. As we saw in the previous example the first filter isn't selective. So let's focus on the second one. If the count of records returned by SELECT COUNT() FROM Account WHERE CustomField__c = 'ValueA' is lower than the selectivity threshold, and CustomField__c is indexed, the query is selective.
    Query 4:

    SELECT Id FROM Account WHERE FormulaField__c = 'ValueA'

Note:
  Formula field can also be indexed since Summer 12. The following rules have to be true in order to index a formula field:
    * The formula contains fields from a single object only (not relationship fields).
    * The formula field doesn't reference any non-deterministic functions (e.g. SYSDATE).
    * The formula field doesn't reference any non-supported fields for including in indexes.  This list isn't documented anywhere specifically (there are lots of special cases), but in Spring 12(176), createdById was non-supported.