[go: nahoru, domu]

US20170139962A1 - Unified time series search across both log data and data from a real-time monitoring environment - Google Patents

Unified time series search across both log data and data from a real-time monitoring environment Download PDF

Info

Publication number
US20170139962A1
US20170139962A1 US15/420,985 US201715420985A US2017139962A1 US 20170139962 A1 US20170139962 A1 US 20170139962A1 US 201715420985 A US201715420985 A US 201715420985A US 2017139962 A1 US2017139962 A1 US 2017139962A1
Authority
US
United States
Prior art keywords
data
time
events
search
computer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US15/420,985
Inventor
Michael Joseph Baum
R. David Carasso
Robin Kumar Das
Rory Greene
Bradley Hall
Nicholas Christian Mealy
Brian Philip Murphy
Stephen Phillip Sorkin
Andre David Stechert
Erik M. Swan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Splunk Inc
Original Assignee
Splunk Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=39269257&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=US20170139962(A1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Splunk Inc filed Critical Splunk Inc
Priority to US15/420,985 priority Critical patent/US20170139962A1/en
Publication of US20170139962A1 publication Critical patent/US20170139962A1/en
Assigned to SPLUNK INC. reassignment SPLUNK INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: STECHERT, ANDRE, HALL, BRAD, BAUM, MICHAEL J., CARASSO, DAVID, DAS, ROBIN K., GREENE, RORY, MEALY, NICK, MURPHY, BRIAN, SORKIN, STEPHEN, SWAN, ERIK M.
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2477Temporal data queries
    • G06F17/30321
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • G06F16/2272Management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2291User-Defined Types; Storage management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • G06F16/2308Concurrency control
    • G06F16/2315Optimistic concurrency control
    • G06F16/2322Optimistic concurrency control using timestamps
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24568Data stream processing; Continuous queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2457Query processing with adaptation to user needs
    • G06F16/24575Query processing with adaptation to user needs using context
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2457Query processing with adaptation to user needs
    • G06F16/24578Query processing with adaptation to user needs using ranking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/248Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • G06F17/30516
    • G06F17/30551
    • G06F17/30864
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units

Definitions

  • This invention relates generally to information organization, search, and retrieval and more particularly to time series data organization, search, and retrieval.
  • Time series data are sequences of time stamped records occurring in one or more usually continuous streams, representing some type of activity made up of discrete events. Examples include information processing logs, market transactions, and sensor data from real-time monitors (supply chains, military operation networks, or security systems). The ability to index, search, and present relevant search results is important to understanding and working with systems emitting large quantities of time series data.
  • Existing large scale search engines are designed to address the needs of less time sensitive types of data and are built on the assumption that only one state of the data needs to be stored in the index repository, for example, URLs in a Web search index, records in a customer database, or documents as part of a file system. Searches for information generally retrieve only a single copy of information based on keyword search terms: a collection of URLs from a Website indexed a few days ago, customer records from close of business yesterday, or a specific version of a document.
  • time series data from a typical information processing environment, shown in FIG. 1 .
  • Firewalls, routers, web servers, application servers and databases constantly generate streams of data in the form of events occurring perhaps hundreds or thousands of times per second.
  • historical data value and the patterns of data behavior over time are generally as important as current data values.
  • Existing search solutions generally have little notion of time-based indexing, searching or relevancy in the presentation of results and don't meet the needs of time series data.
  • an engine for searching time series data preferably would organize the index so that access to various time ranges, including less recent time ranges, is efficient. For example, unlike for many modem search engines, there may be significantly less benefit for a time series search engine to cache the top 1000 results for a particular keyword.
  • time series data is further complicated because the data can be collected from multiple, different sources asynchronously and out of order. Streams of data from one source may be seconds old and data from another source may be interleaved with other sources or may be days, weeks, or months older than other sources. Moreover, data source times may not be in sync with each other, requiring adjustments in time offsets post indexing. Furthermore, time stamps can have an almost unlimited number of formats making identification and interpretation difficult. Time stamps within the data can be hard to locate, with no standard for location, format, or temporal granularity (e.g., day, hour, minute, second, sub-second).
  • Searching time series data typically involves the ability to restrict search results efficiently to specified time windows and other time-based metadata such as frequency, distribution of inter-arrival time, and total number of occurrences or class of result. Keyword-based searching is generally secondary in importance but can be powerful when combined with time-based search mechanisms. Searching time series data requires a whole new way to express searches. Search engines today allow users to search by the most frequently occurring terms or keywords within the data and generally have little notion of time based searching. Given the large volume and repetitive characteristics of time series data, users often need to start by narrowing the set of potential search results using time-based search mechanisms and then, through examination of the results, choose one or more keywords to add to their search parameters.
  • Timeframes and time-based metadata like frequency, distribution, and likelihood of occurrence are especially important when searching time series data, but difficult to achieve with current search engine approaches. Try to find, for example, all stories referring to the “Space Shuttle” between the hours of 10 AM and 11 AM on May 10 , 2005 or the average number of “Space Shuttle” stories per hour the same day with a Web-based search engine of news sites. With a focus on when data happens, time-based search mechanisms and queries can be useful for searching time series data.
  • time-based search Some existing limited applications of time-based search exist in specific small-scale domains. For example, e-mail search is available today in many mainstream email programs and web-based email services. However, searches are limited to simple time functions like before, after, or time ranges; the data sets are generally small scale and highly structured from a single domain; and the real-time indexing mechanisms are append only, usually requiring the rebuilding of the entire index to interleave new data.
  • time series search results preferably would have the ability to be organized and presented by time-based patterns and behaviors. Users need the ability to see results at multiple levels of granularity (e.g., seconds, minutes, hours, days) and distribution (e.g., unexpected or least frequently occurring) and to view summary information reflecting patterns and behaviors across the result set.
  • granularity e.g., seconds, minutes, hours, days
  • distribution e.g., unexpected or least frequently occurring
  • time series search engine it would be desirable for the engine to index and allow for the searching of data in real-time. Any delay between the time data is collected and the time it is available to be searched is to be minimized. Enabling real-time operation against large, frequently changing data sets can be difficult with traditional large-scale search engines that optimize for small search response times at the expense of rapid data availability.
  • Web and document search engines typically start with a seed and crawl to collect data until a certain amount of time elapses or a collection size is reached. A snapshot of the collection is saved and an index is built, optimized, and stored. Frequently accessed indices are then loaded into a caching mechanism to optimize search response time.
  • This process can take hours or even days to complete depending on the size of the data set and density of the index. Contrast this with a real-time time series indexing mechanism designed to minimize the time between when data is collected and when the data is available to be searched.
  • the ability to insert, delete and reorganize indices, on the fly as data is collected, without rebuilding the index structure is essential to indexing time series data and providing real-time search results for this class of time series search engines.
  • log event analyzers such as Sawmill or Google's Sawzall can provide real-time analysis capabilities but are not search engines per se because they do not provide for ad hoc searches. Reports must be defined and built in advance of any analysis. Additionally, no general keyword-based or time-based search mechanisms are available.
  • Other streaming data research projects including the Stanford Streams project and products from companies like StreamBase Systems) can also produce analysis and alerting of streaming data but do not provide any persistence of data, indexing, time-based, or keyword-based searching.
  • TSSE time series search engine
  • one aspect of TSSEs is the use of time as a primary mechanism for indexing, searching, and/or presentation of search results.
  • a time series search language (TSSL) specific to time-based search mechanisms is used to express searches in human readable form and results are presented using relevancy algorithms specific to time series data. Search expression and results presentation are based on key concepts important to searching time series data including but not limited to time windows, frequency, distribution, patterns of occurrences, and related time series data points from multiple, disparate sources.
  • a time series search engine includes four parts: (1) a time stamp process; (2) an indexing process; (3) a search process; and (4) a results presentation process.
  • a computer-implemented method for time searching data includes the following steps.
  • Time series data streams are received.
  • One example of time series data streams includes server logs and other types of machine data (i.e., data generated by machines).
  • the time series data streams are time stamped to create time stamped events.
  • the time stamped events are time indexed to create time bucketed indices, which are used to fulfill search requests.
  • Time series search request are executed, at least in part, by searching the time bucketed indices.
  • time stamping the time series data streams includes aggregating the time series data streams into events and time stamping the events.
  • the events may be classified by domain and then time stamped according to their domain.
  • the time stamp is extracted from the event.
  • the time stamp is interpolated.
  • time bucketed indices are created by assigning the time stamped events to time buckets according to their time stamps.
  • Different bucket policies can be used.
  • the time buckets may all have the same time duration, or may have different time durations.
  • time buckets may be instantiated using a lazy allocation policy.
  • the time stamped events may also be segmented, and the segments used to determine time bucket indices.
  • Various forms of indexing including hot indexing, warm indexing and speculative indexing, may also be used.
  • time bucket indices facilitates the execution of time series searches.
  • a time series search request is divided into different sub-searches for the affected time buckets, with each sub-search executed across the corresponding time bucket index.
  • FIG. 1 (prior art) is a diagram of time series data environments.
  • FIG. 2 is a diagram of a time series search engine according to the invention.
  • FIG. 3 is a diagram of a time stamp process suitable for use with the time series search engine of FIG. 2 .
  • FIG. 4 is a diagram of an event aggregation process suitable for use with the time stamp process of FIG. 3 .
  • FIG. 5 is a diagram of an indexing process suitable for use with the time series search engine of FIG. 2 .
  • FIG. 6 is a diagram of a search process suitable for use with the time series search engine of FIG. 2 .
  • FIG. 7 is a diagram of a results presentation process suitable for use with the time series search engine of FIG. 2 .
  • FIG. 1 illustrates different examples of time series data environments with potentially large numbers of data sources and streams of time series data across multiple domains.
  • the first picture represents an information-processing environment with time series data from web servers, application servers, and databases in the form of server logs.
  • the second picture is a typical market-trading environment with transactions between multiple buyers and sellers and between two or more markets.
  • Time series data is generated in the form of transaction records representing the intention or trade or the final settlement of the trade as examples.
  • a real-time monitoring environment is depicted with multiple sensors producing time series data in the form of recorded measurements. All three of these environments are examples of potential applications for the TSSE.
  • time series data streams 205 at the TSSE 200 can be effected by having the TSSE gather them directly or by having a user-supplied script collect, preprocess, and deliver them to a default TSSE collection point.
  • This architecture preferably tolerates data arriving late and temporally out of order.
  • most sources of time series data are not designed for sophisticated processing of the data, so the TSSE typically will collect or be fed raw time series data that are close to their native form.
  • the TSSE can be situated in different locations so long as it has access to the time series data. For example, one copy of the TSSE can be run on a single central computer or multiple copies can be configured in a peer-to-peer set-up with each copy working on the same time series data streams or different time series data streams.
  • FIG. 2 depicts an example TSSE 200 with four major processes: time stamp process 210 , index process 220 , search process 230 and presentation process 240 .
  • the time stamp process 210 turns raw time series data 205 into time stamped events 215 to be fed to the indexing process 220 .
  • raw logs 205 from multiple web servers, application servers and databases might be processed by the time stamp process 210 to identify individual events 215 within the various log formats and properly extract time and other event data.
  • the event data 215 is used by the index process 220 to build time bucketed indices 225 of the events. These indices 225 are utilized by the search process 230 which takes searches 255 from users or systems, decomposes the searches, and then executes a search across a set of indices.
  • meta events 237 may represent averages, means, or counts of actual events or more sophisticated pattern based behavior. In this case a user might want to search to find all the events occurring with a frequency of three per minute.
  • the search process 230 hands results from the selected indices 235 to the presentation process 240 which merges result sets, ranks results, and feeds the results 275 to an API or user interface for presentation.
  • Process 210 shown in FIG. 2 of an exemplary implementation 200 of a TSSE is to acquire streaming time series data, identify individual events within the stream, and assign time stamps to each event.
  • An example time stamp process 210 block diagram is shown in FIG. 3 and includes several steps including event aggregation 310 , domain identification 320 , time extraction 330 , and time interpolation 340 .
  • Time series data streams 205 are received as input to the time stamp process 210 and then processed into individual time stamped events 215 .
  • Step 310 in the time stamp process 210 of FIG. 3 aggregates the streaming time series data 205 into individual events 315 .
  • web server time series data streams may have a single line per event and be easy to identify.
  • an application server time series data stream may contain single events with a large number of lines making identification of individual events within the stream difficult.
  • event aggregation 310 uses feature extraction (e.g., leading punctuation, significant words, white space, and breaking characters) and machine learning algorithms to determine where the event boundaries are.
  • FIG. 4 is a diagram of an event aggregation process suitable for use with the time stamp process of FIG. 3 .
  • an effective aggregation process 310 (such as shown in FIG. 4 ) preferably will learn about data formats and structure automatically. In one implementation, learning is separated into different domains based on the source of MD 205 . Domains can be general system types, such as log files, message bus traffic, and network management data, or specific types, such as output of a given application or technology—Sendmail logging data, Oracle database audit data, and J2EE messaging.
  • the domain for a given source of MD is identified 415 so that domain specific organization methods can be applied.
  • Domains are determined through a learning process.
  • the learning process uses collections of MD from well-known domains as input and creates a source signature 412 for each domain.
  • source signatures 412 are generated from representative samples of MD 205 by creating a hash table mapping punctuation characters to their frequency. While tokens and token values can change in MD collection, in this particular implementation, the signature 412 generated by the frequency of punctuation is quite stable, and reliable within a specific domain.
  • Other implementations could use functions of the punctuation and tokens, such as the frequencies of the first punctuation character on a line, or the first capitalized term on a line.
  • source signatures 412 can be large and hard to read, signatures can have a corresponding label in the form of a number or text that can be machine generated or human assigned.
  • the source signature 412 for an Apache web server log might be programmatically assigned the label “ 205 ”, or a user can assign the label “Apache Server Log”.
  • clustering is used to classify 415 collected MD 205 into domains according to their source signatures 412 .
  • each collection's signature is matched to the set of known source signatures 412 by performing a nearest-neighbor search. If the distance of the closest matching signature 412 is within a threshold, the closest matching signature 420 's domain is assumed to be the domain of the source. If no best match can be found, a new source signature 412 can be created from the sample signature and a new source domain created. Alternatively, a default source domain can be used.
  • the distance between two signatures is calculated by iterating over the union of attributes of the two signatures, with the total signature distance being the average of distances for each attribute.
  • the distance for attribute A is the square of (V 1 -V 2 )*IDF, where IDF is the log(N/
  • Some MD 205 sources are non-textual or binary and cannot be easily processed unless a known process is available to convert the binary MD into textual form.
  • a sample MD collection is analyzed.
  • Textual MD can also have embedded binary MD, such as a memory dump, and the classification preferably identifies it as such.
  • the textual/binary classification works as follows.
  • the sample is a set of lines of data, where a line is defined as the data between new lines (i.e., ‘ ⁇ n’), carriage returns (i.e., ‘ ⁇ r’), or their combination (i.e., ⁇ r ⁇ n′).
  • a count of Binary-looking lines is incremented. Otherwise, if the line's length is shorter than a length that one would expect most text lines to be below, such as 256 characters, a count of Text-looking lines is incremented. If the number of Text-looking lines is twice as numerous as the Binary-looking lines (other ratios can be used depending on the context), the source is classified as text. Otherwise, the source is classified as binary.
  • the corresponding aggregation rules are applied 425 to the MD collection.
  • Aggregation rules describe the manner in which MD 205 , from a particular domain, is organized 425 into event data 315 by identifying the boundaries of events within a collection of MD, for example, how to locate a discrete event by finding its beginning and ending.
  • the method of aggregation 425 learns, without prior knowledge, by grouping together multiple lines from a sample of MD 205 . Often MD 205 contains events 315 that are anywhere from one to hundreds of lines long that are somehow logically grouped together.
  • the MD collection may be known a priori, or may be classified, as single-line type (i.e., containing only single-line events) or multi-line type (i.e., possibly containing multi-line events) prior to performing aggregation.
  • single-line type i.e., containing only single-line events
  • multi-line type i.e., possibly containing multi-line events
  • a MD collection is classified as a multi-line type if 1) there is a large percentage of lines that start with spaces or are blank (e.g., if more than 5% of the lines start with spaces or are blank), or 2) there are too many varieties of punctuation characters in the first N punctuation characters. For example, if the set of the first three punctuation characters found on each line has more than five patterns (e.g., ‘:::’, ‘!:!’, ‘,,,’, ‘( )*’), the collection might be classified as multi-line.
  • aggregation methods 425 Another aspect of aggregation methods 425 is the ability to learn, and codify into rules, what constitutes a break between lines and therefore the boundary between events, by analyzing a sample of MD. For example, in one implementation, an aggregation method 425 compares every two-line pair looking for statistically similar structures (e.g., use of white space, indentation, and time-stamps) to quickly learn which two belong together and which two are independent. In one implementation, aggregation 425 works as follows. For each line, first check if the line starts with a time-stamp. If so, then break. Typically, lines starting with a time-stamp are the start of a new event.
  • statistically similar structures e.g., use of white space, indentation, and time-stamps
  • Line signatures are used in place of lines, where a line signature is a more stable version of a line, immune to simple numeric and textual changes.
  • signatures can be created by converting a line into a string that is the concatenation of leading white space, any punctuation on the line, and the first word on the line.
  • the line “10:29:03 Host 191.168.0.1 rebooting:normally” is converted to “:: . . . :Host.”
  • this current line signature can be concatenated with the previous line signature (i.e., signature 1 combined with signature 2 ) and used as a combined key into a table of break rules.
  • the break rule table maps the combined key to a break rule, which determines whether there should be a ‘break’, or not, between the two lines (i.e., whether they are part of different events or not).
  • Break rules can have confidence levels, and a more confident rule can override a less confident rule. Break rules can be created automatically by analyzing the co-occurrence data of the two lines and what percent of the time their signatures occur adjacently. If the two line signatures highly co-occur, a new rule would recommend no break between them.
  • a new rule would recommend a break between them. For example, if line signature A is followed by line signature B greater than 20% of the time A is seen, then a break rule might be created to recommend no break between them. Rules can also be created based on the raw number of line signatures that follow/proceed another line signature. For example, if a line signature is followed by say, ten different line signatures, create a rule that recommends a break between them. If there is no break rule in the break rule table, the default behavior is to break and assume the two lines are from different events. Processing proceeds by processing each two-line pair, updating line signature and co-occurrence statistics, and applying and learning corresponding break rules. At regular intervals, the break rule table is written out to the hard disk or permanent storage.
  • time stamp identification step 320 determines whether or not the time series event data contains tokens that indicate a match to one of a collection of known time stamp formats. If so, the event is considered to have a time stamp from a known domain and extraction 330 is performed. Otherwise, interpolation 340 is performed.
  • the event 315 is taken as input to a time stamp extraction step 330 where the time stamp from the raw event data is extracted and passed with the event to the indexing process 220 .
  • this timestamp extraction 330 occurs by iterating over potential time stamp format patterns from a dynamically ordered list in order to extract a time to be recorded as the number of seconds that have passed since the Unix epoch (0 seconds, 0 minutes, 0 hour, Jan. 1, 1970 coordinated universal time) not including leap seconds. Additionally, the implementation takes into account time zone information and normalizes the times to a common offset.
  • the ordering of this list is determined using a move-to-front algorithm, wherein whenever a match is found the matching pattern is moved to the beginning of the list.
  • the most frequently occurring patterns are checked earliest and most often, improving performance.
  • the move-to-front lists may be maintained either for all time series data sources together, on a per-source basis (to take advantage of the fact that the formats in a single source are likely to be similar), or in some other arrangement.
  • time stamp is linearly interpolated 340 from the time stamps of the immediately preceding and immediately following events 315 from the same time series data stream. If these events also contain no time stamps from a known domain, further earlier and/or later events can be used for the interpolation.
  • the time stamp extraction module 330 automatically stores the time stamp of every hundredth event (or some other configurable period) from each time series data stream in order to facilitate time stamp interpolation 340 .
  • time stamps are interpolated 340 based on the time associated with the entire time series data stream 205 including acquisition time, creation time or other contextual meta time data.
  • buckets with a uniform extent are used. For example, each time bucket can handle one hour's worth of data. Alternate policies might vary the bucket extents from one time period to another. For example, a bucketing policy may specify that the buckets for events from earlier than today are three hour buckets, but that the buckets for events occurring during the last 24 hours are hashed by the hour. In the information processing example, a bucket might cover the period Jan. 15, 2005 12:00:00 to Jan. 15, 2005 14:59:59. In order to improve efficiency further, buckets are instantiated using a lazy allocation policy (i.e., as late as possible) in primary memory (i.e., RAM).
  • a lazy allocation policy i.e., as late as possible
  • primary memory i.e., RAM
  • Bucket storage size is another element of the bucketing policy and varies along with the size of the temporal extent.
  • bucket policies typically enforce that buckets (a) do not overlap, and (b) cover all possible incoming time stamps.
  • Step 510 in indexing an event by time is to identify the appropriate bucket for the event based on the event's time stamp and the index's bucketing policy.
  • Each incoming event 215 is assigned 510 to the time bucket where the time stamp from the event matches the bucket's temporal criteria.
  • a database server event with the time stamp of Jan. 15, 2005 12:00:01 might be assigned to the above-mentioned bucket.
  • a segment (also known as a token) is a substring of the incoming event text and a segmentation 520 is the collection of segments implied by the segmentation algorithm on the incoming event data.
  • a segment sub string may overlap another substring, but if it does, it must be contained entirely within that substring. We allow this property to apply recursively to the containing substring, so that the segment hierarchy forms a tree on the incoming text.
  • segmentation 520 is performed by choosing two mutually exclusive sets of characters called minor breakers and major breakers. Whenever a breaking character, minor or major, is encountered during segmentation of the raw data, segments are emitted corresponding to any sequence of bytes that has at least one major breaker on one end of the sequence. For example, if, during segmentation, a minor breaking character is found, then a segment corresponding to the sequence of characters leading from the currently encountered minor breaker back to the last major breaker encountered is recorded. If a major breaker was encountered, then the sequence of characters leading back to either the last major breaker or the last minor breaker, whichever occurred most recently, determines the next segment to be recorded.
  • Segmentation 520 rules describe how to divide event data into segments 525 (also known as tokens).
  • a segmentation rule examines possible separators or punctuation within the event, for example, commas, spaces or semicolons.
  • An important aspect of segmentation is the ability to not only identify individual segments 525 , but also to identify overlapping segments. For example, the text of an email address, “bob.smith@corp.com”, can be broken into individual and overlapping segments; ⁇ bob.smith>, ⁇ @> and ⁇ corp.com> can be identified as individual segments, and ⁇ bob.smith> ⁇ @> ⁇ corp.com>> can also be identified as an overlapping segment.
  • segmentation 520 uses a two-tier system of major and minor breaks.
  • Major breaks are separators or punctuation that bound the outer most segment 525 . Examples include spaces, tabs, and new lines. Minor breaks are separators or punctuation that break larger segments into sub segments, for example periods, commas, and equal signs. In one implementation, more complex separators and punctuation combinations are used to handle complex segmentation tasks 520 , for example handling Java exceptions in an application server log file.
  • IP addresses could be broken down using white space as major breakers and periods as minor breakers.
  • segments for the raw text “192.168.1.1” could be:
  • certain segments may represent known entities that can be labeled and further understood algorithmically or by human added semantics.
  • “192.168.1.1” may be understood to be an IP address.
  • Named entity extraction can be algorithmically performed in a number of ways.
  • the segment values or segment form from the same segment across multiple events is compared to an entity dictionary of known values or known forms.
  • entity extraction techniques are used to identify semantic entities within the data.
  • search trees or regular expressions can be applied to extract and validate, for example, IP addresses or email addresses. The goal of extraction is to assist the segmentation process 520 and provide semantic value to the data.
  • incoming events have time stamps 215 , segments 525 , and a time bucket 515 associated with them.
  • the system preferably is careful in managing access to disk. For the indexes, this is accomplished by splitting index creation into two separate phases: hot indexing and warm indexing. Hot indexes are managed entirely in RAM, are optimized for the smallest possible insert time, are not searchable, and do not persist. “Warm” indexes are searchable and persistent, but immutable. When hot indexes need to be made searchable or need to be persistent, they are converted into warm indexes.
  • a hot index 555 contains a packed array of segments, a packed array of event addresses and their associated time stamps, and a postings list that associates segments with their time stamped event addresses.
  • the packed arrays can have hash tables associated with them to provide for quick removal of duplicates.
  • each segment of the event is tested for duplication using the segment array and its associated hash.
  • the event address is also tested for duplication, against the event address array and its associated hash. If either of the attributes is a duplicate, then the instance of that duplicate that has already been inserted into the packed array is used. Otherwise, the new segment or event address is copied into the appropriate table 550 and the associated hash table is updated.
  • a hot slice is considered to be “at capacity” when one of its packed arrays fills up or when one of its hash tables exceeds a usage threshold (e.g., if more than half of the hash table is in use).
  • a usage threshold e.g., if more than half of the hash table is in use.
  • speculative indexing Based on earlier indexing processes, new time buckets can be initialized using all or part of a representative, completed bucket as an exemplar. In other words, by keeping around copies of data that may reasonably be expected to occur in a time bucket, we can improve indexing performance by speculatively initializing parts of the hot index.
  • the speculative indexing is performed by copying the packed array of segments and its associated hash table from an earlier hot index. The hot slice is then populated as usual with the exception that the segment array is already populated and ready for duplicate testing. Because of the highly regular language and limited vocabulary of machines, the hit rate associated with this speculation can be very good.
  • the searching process allows the user to search on segments, segment prefixes, and segment suffixes.
  • the segments array can be sorted and then stored as a blocked front coded lexicon (hereafter called “the forward lexicon”).
  • the forward lexicon This data structure makes it possible to perform segment and segment prefix lookups efficiently while still achieving a reasonable amount of compression of the segment text.
  • the offset of the segment in the forward lexicon is used as an efficient way to look up metadata associated with the queried-for segment in other associated tables.
  • a blocked front coded lexicon can be created on the same collection of segments after they have been string-reversed (hereafter called “the reverse lexicon”). Also, a map is populated that converts the offset of a reversed segment in the reverse lexicon to the equivalent non-reversed segment's offset in the forward lexicon (hereafter called “the reverse-forward map”). When performing suffix lookups, the offset in the reverse lexicon is used as an offset into the reverse-forward map. The value stored at that position in the map is the appropriate offset to use for the other metadata arrays in the warm index.
  • the warm index provides a list of event offsets for each segment indexed, preferably in an efficient manner. In one implementation, this can be done by maintaining an array of compressed postings lists and an associated array of offsets to the beginning of each of those compressed postings lists.
  • the postings lists are maintained in segment offset order, so when a lookup is performed, the segment ID can be used to find the appropriate entry of the postings lists offsets array.
  • the values in the postings lists entries are the offsets that should be used to look up events in the packed array of event addresses.
  • statistical metadata can be provided for each indexed segment (e.g., the first and last time of occurrence of the segment, the mean inter-arrival time, and the standard deviation of the inter-arrival time).
  • a single time bucket will be filled and committed to disk 560 several times. This will result in multiple, independently searchable indices in secondary storage for a single time span.
  • the indexing process therefore, preferably contains an expiration process 580 that monitors the database for time buckets to be deleted based on user-provided preferences.
  • these preferences might include a trailing time window (“events older than 3 months need not be returned in search results”), a time range (“events earlier than January 1 of this year need not be returned in search results”), a maximum number of events (“no more than 1 million events need be returned in search results”), or a maximum total size for the index (“return as many useful search results as possible while consuming no more than 100 GB of Disk”).
  • a process periodically wakes up and tests the collection of warm slices for any slices that meet the expiration criterion.
  • a warm index file and its associated raw event data and segmentation is moved out of the active index.
  • the index file need not necessarily be deleted. In one implementation, the index file could be streamed to less expensive offline storage.
  • An example TSSE search process is shown in FIG. 6 and includes several methods for parsing 610 a search phrase, issuing multiple sub-searches 625 in order to satisfy the incoming parent search, using sub-search results 635 to prune searches, and merging 640 sub-search results into a final set of search results for presentation to the user.
  • incoming search phrases 255 are parsed 610 according to a time series search language (TSSL) in order to generate annotated parse trees 615 .
  • TSSL time series search language
  • An exemplary TSSL language syntax includes a series of modifiers or commands taking the format name::value. Some modifiers may have default values and some can only be used once, while some can appear several times in the same search with different values. Examples include the following:
  • Modifiers can be combined with keywords, wildcard characters, literal strings, quoted phrases and Boolean operators, such as AND, OR, NOT. Parentheses can be used to nest search and sub-search phrases together.
  • An example search phrase might be “sourcetype::mysql*sock*NOT (started OR (host::foo OR host::BAR)) maxresults::10 (eventtype::baddb OR eventtype::?8512-3) daysago::30”.
  • a custom parser 610 handles the Boolean operators “NOT” and “OR” and defaults to “AND”. This implementation also handles using parentheses to disambiguate the language when there are several operators. Otherwise, it associates left-to-right.
  • the implementation also supports special search operators that are indicated using a domain specifier followed by a demarcation element. For example, searching for “source::1234”, might indicate that the searcher (human or system) wants to restrict results to events that were received from a particular source ID.
  • Incoming search phrases may also trigger ad hoc computation 612 based on a map of special keywords.
  • a special search string might be used to indicate that a search is to be stored and reissued on a periodic basis or to request a list of sources.
  • the search string would be stored in a table on disk along with a schedule specifying the schedule on which the search should be reissued.
  • additional actions may be triggered. For example, an email alert might be sent, an RSS feed might be updated, or a user-supplied script might be executed.
  • Another example of a search that triggers ad hoc computation 612 is one that is indicated to be saved for later use, but not to be reissued on a periodic basis.
  • the search execution engine 620 will use the annotated syntax tree 615 to issue sub-searches 625 to the time bucketed indices 565 .
  • Each sub-search 625 is targeted at an individual time bucket 565 .
  • Time buckets are queried in the order that is most advantageous to pruning given the sort order for the results. For example, if search results are sorted in reverse chronological order, then the sub-search for the most recent time bucket will be issued first. This allows the search execution engine 620 to examine the results 635 of the sub-search before proceeding with additional (expensive) sub-searches 625 . For example, if a particular sub-search returns enough results 635 , then it is not necessary to proceed with additional sub-searches 625 .
  • This merging process performs a merge sort on the results from each of the buckets to keep them in the order required for the presentation process.
  • the final process in an exemplary implementation of our example TSSE is the preparation of search results for presentation 240 , as shown in FIG. 7 .
  • this example TSSE can present results organized by time, event relationships, and keyword relevance ranking.
  • a primary benefit of a TSSE is the ability to index, search and present time series data chronologically. Results can be presented by aggregating and summarizing search results based on discrete time ranges or based on statistical calculations.
  • the example TSSL can specify to see results for only a particular time frame and/or to see results presented by seconds, minutes, hours, days, weeks or months.
  • the search window can be limited to a timeframe and the results can be constructed for optimal viewing based on the density of the expected result set returned from a search.
  • the search “192.168.169.100 hoursago::24 page:: seconds”, will return time series events including the keyword “192.168.169.100” that occurred within the last 24 hours and will summarize the display results by seconds.
  • summarization can include both aggregated display lines summarizing the events for the summary window and/or paging the results by the summary window.
  • each page of the search results presentation may include one second in time. Examples include but are not limited to:
  • an example TSSE preferably is able to present additional aggregation and summarization of results by metadata characteristics 720 , such as, data source, data source type, event type, or originating host machine.
  • metadata characteristics 720 such as, data source, data source type, event type, or originating host machine.
  • results can be not only organized by time, but also refined by metadata aggregation and summarization.
  • the search “192.168.169.100 page::source” will present all the results with “192.168.169.100” and put each data source containing results on a separate page. Examples include but are not limited to:
  • an example TSSE user interaction model can include the ability to move from small increments of time (seconds or minutes) or metadata parameters (different classes of IP addresses) using a zoom control 730 .
  • This zoom control can be combined with other metadata search parameters to enable the rapid movement through large amounts of data. Examples include but are not limited to:
  • the presentation density can be controlled 740 to return and/or display only the raw data without any metadata in a simple ASCII text format.
  • the same results can be returned and or displayed with full metadata as rich XML.
  • the TSSE can be implemented in many different ways.
  • each box shown in the various figures is implemented in software as a separate process. All of the processes can run on a single machine or they can be divided up to run on separate logical or physical machines.
  • the invention is implemented in computer hardware, firmware, software, and/or combinations thereof.
  • Apparatus of the invention can be implemented in a computer program product tangibly embodied in a machine-readable storage device for execution by a programmable processor; and method steps of the invention can be performed by a programmable processor executing a program of instructions to perform functions of the invention by operating on input data and generating output.
  • the invention can be implemented advantageously in one or more computer programs that are executable on a programmable system including at least one programmable processor coupled to receive data and instructions from, and to transmit data and instructions to, a data storage system, at least one input device, and at least one output device.
  • Each computer program can be implemented in a high-level procedural or object-oriented programming language, or in assembly or machine language if desired; and in any case, the language can be a compiled or interpreted language.
  • Suitable processors include, by way of example, both general and special purpose microprocessors. Generally, a processor will receive instructions and data from a read-only memory and/or a random access memory.
  • a computer will include one or more mass storage devices for storing data files; such devices include magnetic disks, such as internal hard disks and removable disks; magneto-optical disks; and optical disks.
  • Storage devices suitable for tangibly embodying computer program instructions and data include all forms of non-volatile memory, including by way of example semiconductor memory devices, such as EPROM, EEPROM, and flash memory devices; magnetic disks such as internal hard disks and removable disks; magneto-optical disks; and CD-ROM disks. Any of the foregoing can be supplemented by, or incorporated in, ASICs (application-specific integrated circuits) and other forms of hardware.
  • ASICs application-specific integrated circuits

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • Fuzzy Systems (AREA)
  • Probability & Statistics with Applications (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Methods and apparatus consistent with the invention provide the ability to organize, index, search, and present time series data based on searches. Time series data are sequences of time stamped records occurring in one or more usually continuous streams, representing some type of activity. In one embodiment, time series data is organized into discrete events with normalized time stamps and the events are indexed by time and keyword. A search is received and relevant event information is retrieved based in whole or in part on the time indexing mechanism, keyword indexing mechanism, or statistical indices calculated at the time of the search.

Description

    RELATED APPLICATIONS
  • The present application is a continuation U.S. patent application Ser. No. 14/611,170, filed on Jan. 30, 2015; which is a continuation of U.S. patent application Ser. No. 13/353,135, filed on Jan. 18, 2012, issued as U.S. Pat. No. 9,002,854 on Apr. 7, 2015; which is a continuation of U.S. patent application Ser. No. 11/868,370, filed Oct. 5, 2007, issued as U.S. Pat. No. 8,112,425 on Feb. 7, 2012; which claims priority under 35 U.S.C. §119(e) to U.S. Provisional Patent Application No. 60/828,283, filed Oct. 5, 2006. The subject matter of all of the foregoing is incorporated herein by reference in its entirety.
  • BACKGROUND
  • Field
  • This invention relates generally to information organization, search, and retrieval and more particularly to time series data organization, search, and retrieval.
  • Description of the Related Art
  • Time series data are sequences of time stamped records occurring in one or more usually continuous streams, representing some type of activity made up of discrete events. Examples include information processing logs, market transactions, and sensor data from real-time monitors (supply chains, military operation networks, or security systems). The ability to index, search, and present relevant search results is important to understanding and working with systems emitting large quantities of time series data.
  • Existing large scale search engines (e.g., Google and Yahoo web search) are designed to address the needs of less time sensitive types of data and are built on the assumption that only one state of the data needs to be stored in the index repository, for example, URLs in a Web search index, records in a customer database, or documents as part of a file system. Searches for information generally retrieve only a single copy of information based on keyword search terms: a collection of URLs from a Website indexed a few days ago, customer records from close of business yesterday, or a specific version of a document.
  • In contrast, consider an example of time series data from a typical information processing environment, shown in FIG. 1. Firewalls, routers, web servers, application servers and databases constantly generate streams of data in the form of events occurring perhaps hundreds or thousands of times per second. Here, historical data value and the patterns of data behavior over time are generally as important as current data values. Existing search solutions generally have little notion of time-based indexing, searching or relevancy in the presentation of results and don't meet the needs of time series data.
  • Compared to full text search engines, which organize their indices so that retrieving documents with the highest relevance scores is most efficient, an engine for searching time series data preferably would organize the index so that access to various time ranges, including less recent time ranges, is efficient. For example, unlike for many modem search engines, there may be significantly less benefit for a time series search engine to cache the top 1000 results for a particular keyword.
  • On the other hand, given the repetitive nature of time series data, opportunities for efficiency of index construction and search optimization are available. However, indexing time series data is further complicated because the data can be collected from multiple, different sources asynchronously and out of order. Streams of data from one source may be seconds old and data from another source may be interleaved with other sources or may be days, weeks, or months older than other sources. Moreover, data source times may not be in sync with each other, requiring adjustments in time offsets post indexing. Furthermore, time stamps can have an almost unlimited number of formats making identification and interpretation difficult. Time stamps within the data can be hard to locate, with no standard for location, format, or temporal granularity (e.g., day, hour, minute, second, sub-second).
  • Searching time series data typically involves the ability to restrict search results efficiently to specified time windows and other time-based metadata such as frequency, distribution of inter-arrival time, and total number of occurrences or class of result. Keyword-based searching is generally secondary in importance but can be powerful when combined with time-based search mechanisms. Searching time series data requires a whole new way to express searches. Search engines today allow users to search by the most frequently occurring terms or keywords within the data and generally have little notion of time based searching. Given the large volume and repetitive characteristics of time series data, users often need to start by narrowing the set of potential search results using time-based search mechanisms and then, through examination of the results, choose one or more keywords to add to their search parameters. Timeframes and time-based metadata like frequency, distribution, and likelihood of occurrence are especially important when searching time series data, but difficult to achieve with current search engine approaches. Try to find, for example, all stories referring to the “Space Shuttle” between the hours of 10 AM and 11 AM on May 10, 2005 or the average number of “Space Shuttle” stories per hour the same day with a Web-based search engine of news sites. With a focus on when data happens, time-based search mechanisms and queries can be useful for searching time series data.
  • Some existing limited applications of time-based search exist in specific small-scale domains. For example, e-mail search is available today in many mainstream email programs and web-based email services. However, searches are limited to simple time functions like before, after, or time ranges; the data sets are generally small scale and highly structured from a single domain; and the real-time indexing mechanisms are append only, usually requiring the rebuilding of the entire index to interleave new data.
  • Also unique to the cyclicality of time series data is the challenge of presenting useful results. Traditional search engines typically present results ranked by popularity and commonality. Contrary to this, for time series data, the ability to focus on data patterns and infrequently occurring, or unusual results may be important. To be useful, time series search results preferably would have the ability to be organized and presented by time-based patterns and behaviors. Users need the ability to see results at multiple levels of granularity (e.g., seconds, minutes, hours, days) and distribution (e.g., unexpected or least frequently occurring) and to view summary information reflecting patterns and behaviors across the result set. Existing search engines, on the other hand, generally return text results sorted by key word density, usage statistics, or links to or from documents and Web pages in attempts to display the most popular results first.
  • In one class of time series search engine, it would be desirable for the engine to index and allow for the searching of data in real-time. Any delay between the time data is collected and the time it is available to be searched is to be minimized. Enabling real-time operation against large, frequently changing data sets can be difficult with traditional large-scale search engines that optimize for small search response times at the expense of rapid data availability. For example, Web and document search engines typically start with a seed and crawl to collect data until a certain amount of time elapses or a collection size is reached. A snapshot of the collection is saved and an index is built, optimized, and stored. Frequently accessed indices are then loaded into a caching mechanism to optimize search response time. This process can take hours or even days to complete depending on the size of the data set and density of the index. Contrast this with a real-time time series indexing mechanism designed to minimize the time between when data is collected and when the data is available to be searched. The ability to insert, delete and reorganize indices, on the fly as data is collected, without rebuilding the index structure is essential to indexing time series data and providing real-time search results for this class of time series search engines.
  • Other software that is focused on time series, e.g., log event analyzers such as Sawmill or Google's Sawzall can provide real-time analysis capabilities but are not search engines per se because they do not provide for ad hoc searches. Reports must be defined and built in advance of any analysis. Additionally, no general keyword-based or time-based search mechanisms are available. Other streaming data research projects (including the Stanford Streams project and products from companies like StreamBase Systems) can also produce analysis and alerting of streaming data but do not provide any persistence of data, indexing, time-based, or keyword-based searching.
  • There exists, therefore, a need to develop other techniques for indexing, searching and presenting search results from time series data.
  • SUMMARY
  • Methods and apparatus consistent with the invention address these and other needs by allowing for the indexing, searching, and retrieval of time series data using a time series search engine (TSSE). In one implementation, one aspect of TSSEs is the use of time as a primary mechanism for indexing, searching, and/or presentation of search results. A time series search language (TSSL) specific to time-based search mechanisms is used to express searches in human readable form and results are presented using relevancy algorithms specific to time series data. Search expression and results presentation are based on key concepts important to searching time series data including but not limited to time windows, frequency, distribution, patterns of occurrences, and related time series data points from multiple, disparate sources.
  • In one aspect of the invention, multiple sources of time series data are organized and indexed for searching and results are presented upon user or machine initiated searches. In another aspect, a time series search engine (TSSE) includes four parts: (1) a time stamp process; (2) an indexing process; (3) a search process; and (4) a results presentation process.
  • In one aspect of the invention, a computer-implemented method for time searching data includes the following steps. Time series data streams are received. One example of time series data streams includes server logs and other types of machine data (i.e., data generated by machines). The time series data streams are time stamped to create time stamped events. The time stamped events are time indexed to create time bucketed indices, which are used to fulfill search requests. Time series search request are executed, at least in part, by searching the time bucketed indices.
  • In certain implementations, time stamping the time series data streams includes aggregating the time series data streams into events and time stamping the events. For example, the events may be classified by domain and then time stamped according to their domain. In one approach, for events that are classified in a domain with a known time stamp format, the time stamp is extracted from the event. However, for events that are not classified in a domain with a known time stamp format, the time stamp is interpolated.
  • In another aspect of the invention, time bucketed indices are created by assigning the time stamped events to time buckets according to their time stamps. Different bucket policies can be used. For example, the time buckets may all have the same time duration, or may have different time durations. In addition, time buckets may be instantiated using a lazy allocation policy. The time stamped events may also be segmented, and the segments used to determine time bucket indices. Various forms of indexing, including hot indexing, warm indexing and speculative indexing, may also be used.
  • The creation of time bucket indices facilitates the execution of time series searches. In one approach, a time series search request is divided into different sub-searches for the affected time buckets, with each sub-search executed across the corresponding time bucket index.
  • Other aspects of the invention include software, computer systems and other devices corresponding to the methods described above, and applications for all of the foregoing.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The invention has other advantages and features which will be more readily apparent from the following detailed description of the invention and the appended claims, when taken in conjunction with the accompanying drawings, in which:
  • FIG. 1 (prior art) is a diagram of time series data environments.
  • FIG. 2 is a diagram of a time series search engine according to the invention.
  • FIG. 3 is a diagram of a time stamp process suitable for use with the time series search engine of FIG. 2.
  • FIG. 4 is a diagram of an event aggregation process suitable for use with the time stamp process of FIG. 3.
  • FIG. 5 is a diagram of an indexing process suitable for use with the time series search engine of FIG. 2.
  • FIG. 6 is a diagram of a search process suitable for use with the time series search engine of FIG. 2.
  • FIG. 7 is a diagram of a results presentation process suitable for use with the time series search engine of FIG. 2.
  • The figures depict embodiments of the present invention for purposes of illustration only. One skilled in the art will readily recognize from the following discussion that alternative embodiments of the structures and methods illustrated herein may be employed without departing from the principles of the invention described herein.
  • DETAILED DESCRIPTION
  • FIG. 1 illustrates different examples of time series data environments with potentially large numbers of data sources and streams of time series data across multiple domains. In this figure, the first picture represents an information-processing environment with time series data from web servers, application servers, and databases in the form of server logs. The second picture is a typical market-trading environment with transactions between multiple buyers and sellers and between two or more markets. Time series data is generated in the form of transaction records representing the intention or trade or the final settlement of the trade as examples. In the third picture, a real-time monitoring environment is depicted with multiple sensors producing time series data in the form of recorded measurements. All three of these environments are examples of potential applications for the TSSE.
  • Aspects of the invention will be described with respect to the first picture in FIG. 1, the information-processing environment, but the invention can also be used with other time series data environments and applications including the other environments shown in FIG. 1.
  • FIG. 2 illustrates one approach 200 to architecting a TSSE. Time series data streams 205 arrive synchronously or asynchronously from multiple sources, multiple searches 255 are expressed by users and/or other systems, and results sets 275 are presented through a variety of mechanisms including, for example, application programming interfaces and web-based user interfaces.
  • The arrival of time series data streams 205 at the TSSE 200 can be effected by having the TSSE gather them directly or by having a user-supplied script collect, preprocess, and deliver them to a default TSSE collection point. This architecture preferably tolerates data arriving late and temporally out of order. Currently, most sources of time series data are not designed for sophisticated processing of the data, so the TSSE typically will collect or be fed raw time series data that are close to their native form. The TSSE can be situated in different locations so long as it has access to the time series data. For example, one copy of the TSSE can be run on a single central computer or multiple copies can be configured in a peer-to-peer set-up with each copy working on the same time series data streams or different time series data streams.
  • FIG. 2 depicts an example TSSE 200 with four major processes: time stamp process 210, index process 220, search process 230 and presentation process 240. The time stamp process 210 turns raw time series data 205 into time stamped events 215 to be fed to the indexing process 220. Following our information processing example, raw logs 205 from multiple web servers, application servers and databases might be processed by the time stamp process 210 to identify individual events 215 within the various log formats and properly extract time and other event data. The event data 215 is used by the index process 220 to build time bucketed indices 225 of the events. These indices 225 are utilized by the search process 230 which takes searches 255 from users or systems, decomposes the searches, and then executes a search across a set of indices.
  • For example, a user might want to locate all the events from a particular web server and a particular application server occurring within the last hour and which contain a specific IP address. In addition, the search process 230 may choose to initiate the creation of meta events 237 at search time to handle time-based and statistical summary indices useful in searching through repetitive, temporal data. For example, meta events 237 may represent averages, means, or counts of actual events or more sophisticated pattern based behavior. In this case a user might want to search to find all the events occurring with a frequency of three per minute.
  • Upon completion, the search process 230 hands results from the selected indices 235 to the presentation process 240 which merges result sets, ranks results, and feeds the results 275 to an API or user interface for presentation.
  • Time Stamp Process
  • Process 210 shown in FIG. 2 of an exemplary implementation 200 of a TSSE is to acquire streaming time series data, identify individual events within the stream, and assign time stamps to each event. An example time stamp process 210 block diagram is shown in FIG. 3 and includes several steps including event aggregation 310, domain identification 320, time extraction 330, and time interpolation 340. Time series data streams 205 are received as input to the time stamp process 210 and then processed into individual time stamped events 215.
  • Event Aggregation
  • Step 310 in the time stamp process 210 of FIG. 3 aggregates the streaming time series data 205 into individual events 315. In our information-processing example, web server time series data streams may have a single line per event and be easy to identify. However, an application server time series data stream may contain single events with a large number of lines making identification of individual events within the stream difficult.
  • In one implementation, event aggregation 310 uses feature extraction (e.g., leading punctuation, significant words, white space, and breaking characters) and machine learning algorithms to determine where the event boundaries are. FIG. 4 is a diagram of an event aggregation process suitable for use with the time stamp process of FIG. 3.
  • Source Identification—Classification into Domains
  • Given the repetitive, yet dynamic, nature of the time series data 205 in our information processing example (which data will be referred to as machine data 205 or MD 205), an effective aggregation process 310 (such as shown in FIG. 4) preferably will learn about data formats and structure automatically. In one implementation, learning is separated into different domains based on the source of MD 205. Domains can be general system types, such as log files, message bus traffic, and network management data, or specific types, such as output of a given application or technology—Sendmail logging data, Oracle database audit data, and J2EE messaging.
  • In this example event aggregation process 310, the domain for a given source of MD is identified 415 so that domain specific organization methods can be applied. Domains are determined through a learning process. The learning process uses collections of MD from well-known domains as input and creates a source signature 412 for each domain. In one implementation, source signatures 412 are generated from representative samples of MD 205 by creating a hash table mapping punctuation characters to their frequency. While tokens and token values can change in MD collection, in this particular implementation, the signature 412 generated by the frequency of punctuation is quite stable, and reliable within a specific domain. Other implementations could use functions of the punctuation and tokens, such as the frequencies of the first punctuation character on a line, or the first capitalized term on a line. Given that source signatures 412 can be large and hard to read, signatures can have a corresponding label in the form of a number or text that can be machine generated or human assigned. For example, the source signature 412 for an Apache web server log might be programmatically assigned the label “205”, or a user can assign the label “Apache Server Log”.
  • In one embodiment, clustering is used to classify 415 collected MD 205 into domains according to their source signatures 412. As collections of MD 205 are encountered, each collection's signature is matched to the set of known source signatures 412 by performing a nearest-neighbor search. If the distance of the closest matching signature 412 is within a threshold, the closest matching signature 420's domain is assumed to be the domain of the source. If no best match can be found, a new source signature 412 can be created from the sample signature and a new source domain created. Alternatively, a default source domain can be used. In one implementation, the distance between two signatures is calculated by iterating over the union of attributes of the two signatures, with the total signature distance being the average of distances for each attribute. For each attribute A, the value of A on Signature1 and Signature2, V1 and V2, are compared and a distance is calculated. The distance for attribute A is the square of (V1-V2)*IDF, where IDF is the log(N/|A|), where N is the number of signatures, and |A| is the number of signatures with attribute A.
  • Source Identification—Classification as Text/Binary
  • Some MD 205 sources are non-textual or binary and cannot be easily processed unless a known process is available to convert the binary MD into textual form. To classify a source as textual or binary, a sample MD collection is analyzed. Textual MD can also have embedded binary MD, such as a memory dump, and the classification preferably identifies it as such. In one implementation, the textual/binary classification works as follows. The sample is a set of lines of data, where a line is defined as the data between new lines (i.e., ‘\n’), carriage returns (i.e., ‘\r’), or their combination (i.e., \r\n′). For each line, if the line's length is larger than some large threshold, such as 2 k characters, or if the line contains a character with an ASCII value of zero (0), a count of Binary-looking lines is incremented. Otherwise, if the line's length is shorter than a length that one would expect most text lines to be below, such as 256 characters, a count of Text-looking lines is incremented. If the number of Text-looking lines is twice as numerous as the Binary-looking lines (other ratios can be used depending on the context), the source is classified as text. Otherwise, the source is classified as binary.
  • Aggregation of Machine Data into Raw Events
  • When the source signature 420 for a collection of MD has been identified 415, the corresponding aggregation rules are applied 425 to the MD collection. Aggregation rules describe the manner in which MD 205, from a particular domain, is organized 425 into event data 315 by identifying the boundaries of events within a collection of MD, for example, how to locate a discrete event by finding its beginning and ending. In one implementation, the method of aggregation 425 learns, without prior knowledge, by grouping together multiple lines from a sample of MD 205. Often MD 205 contains events 315 that are anywhere from one to hundreds of lines long that are somehow logically grouped together.
  • The MD collection may be known a priori, or may be classified, as single-line type (i.e., containing only single-line events) or multi-line type (i.e., possibly containing multi-line events) prior to performing aggregation. For those MD collections that are classified as single line type, aggregation 425 is simple—single-line type MD collections are broken on each line as a separate event. Multi-line type MD collections are processed 425 for aggregation. In one implementation, a MD collection is classified as a multi-line type if 1) there is a large percentage of lines that start with spaces or are blank (e.g., if more than 5% of the lines start with spaces or are blank), or 2) there are too many varieties of punctuation characters in the first N punctuation characters. For example, if the set of the first three punctuation characters found on each line has more than five patterns (e.g., ‘:::’, ‘!:!’, ‘,,,’, ‘( )*’), the collection might be classified as multi-line.
  • Another aspect of aggregation methods 425 is the ability to learn, and codify into rules, what constitutes a break between lines and therefore the boundary between events, by analyzing a sample of MD. For example, in one implementation, an aggregation method 425 compares every two-line pair looking for statistically similar structures (e.g., use of white space, indentation, and time-stamps) to quickly learn which two belong together and which two are independent. In one implementation, aggregation 425 works as follows. For each line, first check if the line starts with a time-stamp. If so, then break. Typically, lines starting with a time-stamp are the start of a new event. For lines that do not start with a time-stamp, combine the current line with the prior line to see how often the pair of lines occurs, one before the other, as a percentage of total pairs in the MD sample. Line signatures are used in place of lines, where a line signature is a more stable version of a line, immune to simple numeric and textual changes. In this implementation, signatures can be created by converting a line into a string that is the concatenation of leading white space, any punctuation on the line, and the first word on the line. The line “10:29:03 Host 191.168.0.1 rebooting:normally” is converted to “:: . . . :Host.”
  • Now this current line signature can be concatenated with the previous line signature (i.e., signature1 combined with signature2) and used as a combined key into a table of break rules. The break rule table maps the combined key to a break rule, which determines whether there should be a ‘break’, or not, between the two lines (i.e., whether they are part of different events or not). Break rules can have confidence levels, and a more confident rule can override a less confident rule. Break rules can be created automatically by analyzing the co-occurrence data of the two lines and what percent of the time their signatures occur adjacently. If the two line signatures highly co-occur, a new rule would recommend no break between them. Alternatively, if they rarely co-occur, a new rule would recommend a break between them. For example, if line signature A is followed by line signature B greater than 20% of the time A is seen, then a break rule might be created to recommend no break between them. Rules can also be created based on the raw number of line signatures that follow/proceed another line signature. For example, if a line signature is followed by say, ten different line signatures, create a rule that recommends a break between them. If there is no break rule in the break rule table, the default behavior is to break and assume the two lines are from different events. Processing proceeds by processing each two-line pair, updating line signature and co-occurrence statistics, and applying and learning corresponding break rules. At regular intervals, the break rule table is written out to the hard disk or permanent storage.
  • Time Stamp Identification
  • Once the incoming time series stream 205 has been aggregated 310 into individual events 315, the events and their event data are input into a time stamp identification step 320 which determines whether or not the time series event data contains tokens that indicate a match to one of a collection of known time stamp formats. If so, the event is considered to have a time stamp from a known domain and extraction 330 is performed. Otherwise, interpolation 340 is performed.
  • Time Stamp Extraction
  • If a known domain has been identified for an event, the event 315 is taken as input to a time stamp extraction step 330 where the time stamp from the raw event data is extracted and passed with the event to the indexing process 220. In an exemplary implementation, this timestamp extraction 330 occurs by iterating over potential time stamp format patterns from a dynamically ordered list in order to extract a time to be recorded as the number of seconds that have passed since the Unix epoch (0 seconds, 0 minutes, 0 hour, Jan. 1, 1970 coordinated universal time) not including leap seconds. Additionally, the implementation takes into account time zone information and normalizes the times to a common offset. To increase performance, the ordering of this list is determined using a move-to-front algorithm, wherein whenever a match is found the matching pattern is moved to the beginning of the list. In such an implementation, the most frequently occurring patterns are checked earliest and most often, improving performance. The move-to-front lists may be maintained either for all time series data sources together, on a per-source basis (to take advantage of the fact that the formats in a single source are likely to be similar), or in some other arrangement.
  • Time Stamp Interpolation
  • In the case where the event did not contain a time stamp from a known domain, then a timestamp is assigned to the event based on its context. In one implementation, the time stamp is linearly interpolated 340 from the time stamps of the immediately preceding and immediately following events 315 from the same time series data stream. If these events also contain no time stamps from a known domain, further earlier and/or later events can be used for the interpolation. The time stamp extraction module 330 automatically stores the time stamp of every hundredth event (or some other configurable period) from each time series data stream in order to facilitate time stamp interpolation 340. In another implementation, time stamps are interpolated 340 based on the time associated with the entire time series data stream 205 including acquisition time, creation time or other contextual meta time data.
  • Indexing Process
  • Returning to FIG. 2, in the indexing process 220, indexes are created based on incoming event data 215. The indexing process 220 organizes and optimizes the set of indices in an online fashion as they are extended with more events. An example TSSE indexing process 220 is shown in FIG. 5 and includes, in one implementation, several steps including bucketing 510, segmenting 520, archival 530, allocation 540, insertion 550, committing to secondary storage 560, merging buckets in secondary storage 570, and expiring buckets in secondary storage 580.
  • Time Bucketing
  • Events indexed by the TSSE are often queried, updated, and expired using time-based operators. By hashing the components of the index over a set of buckets organized by time, the efficiency and performance of these operators can be significantly improved. The final efficiency of the bucketing will, of course, depend on the hardware configuration, the order in which the events arrive, and how they are queried, so there is not a single perfect bucketing policy.
  • In one implementation, buckets with a uniform extent are used. For example, each time bucket can handle one hour's worth of data. Alternate policies might vary the bucket extents from one time period to another. For example, a bucketing policy may specify that the buckets for events from earlier than today are three hour buckets, but that the buckets for events occurring during the last 24 hours are hashed by the hour. In the information processing example, a bucket might cover the period Jan. 15, 2005 12:00:00 to Jan. 15, 2005 14:59:59. In order to improve efficiency further, buckets are instantiated using a lazy allocation policy (i.e., as late as possible) in primary memory (i.e., RAM). In-memory buckets have a maximum capacity and, when they reach their limit, they will be committed to disk and replaced by a new bucket. Bucket storage size is another element of the bucketing policy and varies along with the size of the temporal extent. Finally, bucket policies typically enforce that buckets (a) do not overlap, and (b) cover all possible incoming time stamps.
  • Step 510 in indexing an event by time is to identify the appropriate bucket for the event based on the event's time stamp and the index's bucketing policy. Each incoming event 215 is assigned 510 to the time bucket where the time stamp from the event matches the bucket's temporal criteria. In one implementation, we use half-open intervals, defined by a start time and an end time where the start time is an inclusive boundary and the end time is an exclusive boundary. We do this so that events occurring on bucket boundaries are uniquely assigned to a bucket. Following our example in the information processing environment, a database server event with the time stamp of Jan. 15, 2005 12:00:01 might be assigned to the above-mentioned bucket.
  • Segmentation
  • Once an appropriate bucket has been identified 510 for an event, the raw event data is segmented 520. A segment (also known as a token) is a substring of the incoming event text and a segmentation 520 is the collection of segments implied by the segmentation algorithm on the incoming event data. A segment sub string may overlap another substring, but if it does, it must be contained entirely within that substring. We allow this property to apply recursively to the containing substring, so that the segment hierarchy forms a tree on the incoming text.
  • In one implementation, segmentation 520 is performed by choosing two mutually exclusive sets of characters called minor breakers and major breakers. Whenever a breaking character, minor or major, is encountered during segmentation of the raw data, segments are emitted corresponding to any sequence of bytes that has at least one major breaker on one end of the sequence. For example, if, during segmentation, a minor breaking character is found, then a segment corresponding to the sequence of characters leading from the currently encountered minor breaker back to the last major breaker encountered is recorded. If a major breaker was encountered, then the sequence of characters leading back to either the last major breaker or the last minor breaker, whichever occurred most recently, determines the next segment to be recorded.
  • Segmentation 520 rules describe how to divide event data into segments 525 (also known as tokens). In one implementation a segmentation rule examines possible separators or punctuation within the event, for example, commas, spaces or semicolons. An important aspect of segmentation is the ability to not only identify individual segments 525, but also to identify overlapping segments. For example, the text of an email address, “bob.smith@corp.com”, can be broken into individual and overlapping segments; <bob.smith>, <@> and <corp.com> can be identified as individual segments, and <<bob.smith><@>< corp.com>> can also be identified as an overlapping segment. As described above, in one implementation, segmentation 520 uses a two-tier system of major and minor breaks. Major breaks are separators or punctuation that bound the outer most segment 525. Examples include spaces, tabs, and new lines. Minor breaks are separators or punctuation that break larger segments into sub segments, for example periods, commas, and equal signs. In one implementation, more complex separators and punctuation combinations are used to handle complex segmentation tasks 520, for example handling Java exceptions in an application server log file.
  • An example of segmentation in our information-processing example, IP addresses could be broken down using white space as major breakers and periods as minor breakers. Thus, the segments for the raw text “192.168.1.1” could be:
  • “192”
  • “192.168”
  • “192.168.1”
  • “192.168.1.1”
  • In another implementation, certain segments may represent known entities that can be labeled and further understood algorithmically or by human added semantics. For example, in the above representation, “192.168.1.1” may be understood to be an IP address. Named entity extraction can be algorithmically performed in a number of ways. In one implementation, the segment values or segment form from the same segment across multiple events is compared to an entity dictionary of known values or known forms.
  • In another implementation, entity extraction techniques are used to identify semantic entities within the data. In one implementation, search trees or regular expressions can be applied to extract and validate, for example, IP addresses or email addresses. The goal of extraction is to assist the segmentation process 520 and provide semantic value to the data. Archiving and Indexing Events
  • At this point in the process, incoming events have time stamps 215, segments 525, and a time bucket 515 associated with them. To create the persistent data structures that will be used later to perform lookups in the search process, we store the raw data of the event with its segmentation, create indices that map segments and time stamps to offsets in the event data store, and compute and store metadata related to the indices.
  • Because the TSSE tolerates, in near real time, both the arrival of new events and new searches, the system preferably is careful in managing access to disk. For the indexes, this is accomplished by splitting index creation into two separate phases: hot indexing and warm indexing. Hot indexes are managed entirely in RAM, are optimized for the smallest possible insert time, are not searchable, and do not persist. “Warm” indexes are searchable and persistent, but immutable. When hot indexes need to be made searchable or need to be persistent, they are converted into warm indexes.
  • In the implementation shown in FIG. 5, a hot index 555 contains a packed array of segments, a packed array of event addresses and their associated time stamps, and a postings list that associates segments with their time stamped event addresses. For performance reasons, the packed arrays can have hash tables associated with them to provide for quick removal of duplicates. When incoming events are being indexed, each segment of the event is tested for duplication using the segment array and its associated hash. The event address is also tested for duplication, against the event address array and its associated hash. If either of the attributes is a duplicate, then the instance of that duplicate that has already been inserted into the packed array is used. Otherwise, the new segment or event address is copied into the appropriate table 550 and the associated hash table is updated. As events are inserted into the hot index, the space associated with each of the packed arrays gets used. A hot slice is considered to be “at capacity” when one of its packed arrays fills up or when one of its hash tables exceeds a usage threshold (e.g., if more than half of the hash table is in use). Once a hot index reaches capacity 540, it cannot accept more segments for indexing. Instead it is converted to a warm index, committed to disk 560, and replaced with a new empty hot index.
  • Another feature of this particular system is speculative indexing. Based on earlier indexing processes, new time buckets can be initialized using all or part of a representative, completed bucket as an exemplar. In other words, by keeping around copies of data that may reasonably be expected to occur in a time bucket, we can improve indexing performance by speculatively initializing parts of the hot index. In one embodiment, the speculative indexing is performed by copying the packed array of segments and its associated hash table from an earlier hot index. The hot slice is then populated as usual with the exception that the segment array is already populated and ready for duplicate testing. Because of the highly regular language and limited vocabulary of machines, the hit rate associated with this speculation can be very good.
  • The searching process (as described in the next section) allows the user to search on segments, segment prefixes, and segment suffixes. To accommodate these search types, in one implementation, the segments array can be sorted and then stored as a blocked front coded lexicon (hereafter called “the forward lexicon”). This data structure makes it possible to perform segment and segment prefix lookups efficiently while still achieving a reasonable amount of compression of the segment text. When a search is being performed on a particular segment, the offset of the segment in the forward lexicon is used as an efficient way to look up metadata associated with the queried-for segment in other associated tables.
  • To handle suffix lookups, a blocked front coded lexicon can be created on the same collection of segments after they have been string-reversed (hereafter called “the reverse lexicon”). Also, a map is populated that converts the offset of a reversed segment in the reverse lexicon to the equivalent non-reversed segment's offset in the forward lexicon (hereafter called “the reverse-forward map”). When performing suffix lookups, the offset in the reverse lexicon is used as an offset into the reverse-forward map. The value stored at that position in the map is the appropriate offset to use for the other metadata arrays in the warm index.
  • The warm index provides a list of event offsets for each segment indexed, preferably in an efficient manner. In one implementation, this can be done by maintaining an array of compressed postings lists and an associated array of offsets to the beginning of each of those compressed postings lists. The postings lists are maintained in segment offset order, so when a lookup is performed, the segment ID can be used to find the appropriate entry of the postings lists offsets array. The values in the postings lists entries are the offsets that should be used to look up events in the packed array of event addresses.
  • Finally, statistical metadata can be provided for each indexed segment (e.g., the first and last time of occurrence of the segment, the mean inter-arrival time, and the standard deviation of the inter-arrival time).
  • During the course of the indexing process, it is possible that a single time bucket will be filled and committed to disk 560 several times. This will result in multiple, independently searchable indices in secondary storage for a single time span. In an exemplary implementation, there is a merging process 570 that takes as input two or more warm indices and merges them into a single warm index for that time bucket. This is a performance optimization and is not strictly required for searching.
  • Expiring Events
  • Furthermore, over a long period of time, it is possible that applying the indexing process 220 to time series data will cause a large amount of persistent data to accumulate. The indexing process, therefore, preferably contains an expiration process 580 that monitors the database for time buckets to be deleted based on user-provided preferences. In one implementation, these preferences might include a trailing time window (“events older than 3 months need not be returned in search results”), a time range (“events earlier than January 1 of this year need not be returned in search results”), a maximum number of events (“no more than 1 million events need be returned in search results”), or a maximum total size for the index (“return as many useful search results as possible while consuming no more than 100 GB of Disk”). A process periodically wakes up and tests the collection of warm slices for any slices that meet the expiration criterion. Upon expiration, a warm index file and its associated raw event data and segmentation is moved out of the active index. The index file need not necessarily be deleted. In one implementation, the index file could be streamed to less expensive offline storage.
  • Search Process
  • An example TSSE search process is shown in FIG. 6 and includes several methods for parsing 610 a search phrase, issuing multiple sub-searches 625 in order to satisfy the incoming parent search, using sub-search results 635 to prune searches, and merging 640 sub-search results into a final set of search results for presentation to the user.
  • Time Series Search Language
  • During search processing, incoming search phrases 255 are parsed 610 according to a time series search language (TSSL) in order to generate annotated parse trees 615. An exemplary TSSL language syntax includes a series of modifiers or commands taking the format name::value. Some modifiers may have default values and some can only be used once, while some can appear several times in the same search with different values. Examples include the following:
    • average::value—calculate the average number of events using the value time frame.
    • page::value—present search results by value. Value can be seconds, minutes, hours, days, weeks or months or any other metadata element, for example, source or event type.
    • count::—calculate the total number of for events.
    • daysago::value—search for events within the last value days.
    • index::value—the index to search-main, default, history, or another index defined by the TSSE.
    • hoursago::value—search for events within the last value hours.
    • eventtype::value—search for events with an event type or tag that matches the specified value.
    • host::value—search for events whose hostname was set to the specified value. This is the host that logged the event, not necessarily the host that generated the event.
    • maxresults::value—the maximum number ofresults to return. minutesago::value—search for events within the last value minutes.
    • related::value—search for events with segment values (e.g., 404 or username) matching one or more in the current event.
    • similar::value—search for events with a similar event type to the current event.
    • sourcetype::value—search for events with a given sourcetype of value.
    • unexpected:: value—search for events that lie outside observed patterns in the index by the specified value of 0 (expected) to 9 (most unexpected).
  • Modifiers can be combined with keywords, wildcard characters, literal strings, quoted phrases and Boolean operators, such as AND, OR, NOT. Parentheses can be used to nest search and sub-search phrases together. An example search phrase might be “sourcetype::mysql*sock*NOT (started OR (host::foo OR host::BAR)) maxresults::10 (eventtype::baddb OR eventtype::?8512-3) daysago::30”.
  • In one implementation, a custom parser 610 handles the Boolean operators “NOT” and “OR” and defaults to “AND”. This implementation also handles using parentheses to disambiguate the language when there are several operators. Otherwise, it associates left-to-right. The implementation also supports special search operators that are indicated using a domain specifier followed by a demarcation element. For example, searching for “source::1234”, might indicate that the searcher (human or system) wants to restrict results to events that were received from a particular source ID.
  • Incoming search phrases may also trigger ad hoc computation 612 based on a map of special keywords. For example, a special search string might be used to indicate that a search is to be stored and reissued on a periodic basis or to request a list of sources. In this case, the search string would be stored in a table on disk along with a schedule specifying the schedule on which the search should be reissued. Depending on the results of the search when executed, additional actions may be triggered. For example, an email alert might be sent, an RSS feed might be updated, or a user-supplied script might be executed. Another example of a search that triggers ad hoc computation 612 is one that is indicated to be saved for later use, but not to be reissued on a periodic basis.
  • Assuming that the search parser 610 determined that an annotated syntax tree 615 should be created for the search string, the next component, the search execution engine 620 will use the annotated syntax tree 615 to issue sub-searches 625 to the time bucketed indices 565. Each sub-search 625 is targeted at an individual time bucket 565. Time buckets are queried in the order that is most advantageous to pruning given the sort order for the results. For example, if search results are sorted in reverse chronological order, then the sub-search for the most recent time bucket will be issued first. This allows the search execution engine 620 to examine the results 635 of the sub-search before proceeding with additional (expensive) sub-searches 625. For example, if a particular sub-search returns enough results 635, then it is not necessary to proceed with additional sub-searches 625.
  • Once enough results sets 637 have been accumulated to satisfy the parent search, another module will take the results and merge 640 them into a single result set 235, 237 that satisfies the search. This merging process, in one implementation, performs a merge sort on the results from each of the buckets to keep them in the order required for the presentation process.
  • Presentation Process
  • The final process in an exemplary implementation of our example TSSE is the preparation of search results for presentation 240, as shown in FIG. 7. Unlike current large-scale search engines that present non-interactive results ordered by keyword relevance ranking, this example TSSE can present results organized by time, event relationships, and keyword relevance ranking.
  • Time Based Presentation
  • Unique to the challenge of indexing and searching time series data is the presentation of results using time as a primary dimension 710. Because existing large-scale search engines do not organize information by time, the presentation of time-based results is not a consideration. However, a primary benefit of a TSSE is the ability to index, search and present time series data chronologically. Results can be presented by aggregating and summarizing search results based on discrete time ranges or based on statistical calculations.
  • For example, the example TSSL can specify to see results for only a particular time frame and/or to see results presented by seconds, minutes, hours, days, weeks or months. In this way the search window can be limited to a timeframe and the results can be constructed for optimal viewing based on the density of the expected result set returned from a search. The search “192.168.169.100 hoursago::24 page:: seconds”, will return time series events including the keyword “192.168.169.100” that occurred within the last 24 hours and will summarize the display results by seconds. In an exemplary implementation of a TSSE, summarization can include both aggregated display lines summarizing the events for the summary window and/or paging the results by the summary window. In the example above, each page of the search results presentation may include one second in time. Examples include but are not limited to:
      • Ability to scroll/page through the data (n) results at a time by count.
      • Ability to scroll/page through the data by time: next/previous second, minute, hour, day, year.
      • Ability to specify max count per timeframe.
      • Ability to get next (n) results within a paged time frame—(within a second) get next 100.
    Metadata Presentation
  • In addition to time-based presentation 710, an example TSSE preferably is able to present additional aggregation and summarization of results by metadata characteristics 720, such as, data source, data source type, event type, or originating host machine. In this way, results can be not only organized by time, but also refined by metadata aggregation and summarization. The search “192.168.169.100 page::source” will present all the results with “192.168.169.100” and put each data source containing results on a separate page. Examples include but are not limited to:
      • Original physical location of the data source.
      • Original physical machine, sensor etc. generating the data.
      • Type of data source as dynamically assigned by the indexing process.
      • Type of event as dynamically assigned by the indexing process.
    Zoom Control
  • Because time and certain metadata parameters (e.g., machine IP addresses) can be continuous, an example TSSE user interaction model can include the ability to move from small increments of time (seconds or minutes) or metadata parameters (different classes of IP addresses) using a zoom control 730. This zoom control can be combined with other metadata search parameters to enable the rapid movement through large amounts of data. Examples include but are not limited to:
      • Ability to zoom in and out around a given time from any second(s) to minute(s), hour(s), etc.
      • Ability to zoom in to second resolution around 12:15 AM Jun. 3, 2005, for a specific data source type and physical machine location.
    Presentation Density Control
  • Given the different types of users (humans and systems) and the varying types of time series data and events (e.g., single line events a few bytes in size, to multiple line events several megabytes in size) it is useful to be able to specify the density of the results. In one implementation the presentation density can be controlled 740 to return and/or display only the raw data without any metadata in a simple ASCII text format. Alternatively the same results can be returned and or displayed with full metadata as rich XML.
  • Implementation
  • The TSSE can be implemented in many different ways. In one approach, each box shown in the various figures is implemented in software as a separate process. All of the processes can run on a single machine or they can be divided up to run on separate logical or physical machines. In alternate embodiments, the invention is implemented in computer hardware, firmware, software, and/or combinations thereof. Apparatus of the invention can be implemented in a computer program product tangibly embodied in a machine-readable storage device for execution by a programmable processor; and method steps of the invention can be performed by a programmable processor executing a program of instructions to perform functions of the invention by operating on input data and generating output. The invention can be implemented advantageously in one or more computer programs that are executable on a programmable system including at least one programmable processor coupled to receive data and instructions from, and to transmit data and instructions to, a data storage system, at least one input device, and at least one output device. Each computer program can be implemented in a high-level procedural or object-oriented programming language, or in assembly or machine language if desired; and in any case, the language can be a compiled or interpreted language. Suitable processors include, by way of example, both general and special purpose microprocessors. Generally, a processor will receive instructions and data from a read-only memory and/or a random access memory. Generally, a computer will include one or more mass storage devices for storing data files; such devices include magnetic disks, such as internal hard disks and removable disks; magneto-optical disks; and optical disks. Storage devices suitable for tangibly embodying computer program instructions and data include all forms of non-volatile memory, including by way of example semiconductor memory devices, such as EPROM, EEPROM, and flash memory devices; magnetic disks such as internal hard disks and removable disks; magneto-optical disks; and CD-ROM disks. Any of the foregoing can be supplemented by, or incorporated in, ASICs (application-specific integrated circuits) and other forms of hardware.
  • Therefore, although the detailed description contains many specifics, these should not be construed as limiting the scope of the invention but merely as illustrating different examples and aspects of the invention. It should be appreciated that the scope of the invention includes other embodiments not discussed in detail above. Various modifications, changes and variations which will be apparent to those skilled in the art may be made in the arrangement, operation and details of the method and apparatus of the present invention disclosed herein without departing from the spirit and scope of the invention as defined in the appended claims. Therefore, the scope of the invention should be determined by the appended claims and their legal equivalents.

Claims (30)

1. A computer-implemented method, comprising:
obtaining log data generated by at least one component in an information processing environment;
obtaining data that is not log data from a real-time monitoring environment;
storing the log data in a searchable time series data store;
storing the data obtained from the real-time monitoring environment in the searchable time series data store; and
executing a search on the log data and the data obtained from the real-time monitoring environment in the searchable time series data store.
2. The computer-implemented method of claim 1, wherein the data obtained from the real-time monitoring environment includes sensor data.
3. The computer-implemented method of claim 1, wherein the data obtained from the real-time monitoring environment includes measurement data.
4. The computer-implemented method of claim 1, wherein the data obtained from the real-time monitoring environment includes operational performance data.
5. The computer-implemented method of claim 1, wherein executing the search includes executing the search to find similar data.
6. The computer-implemented method of claim 1, wherein executing the search includes executing the search to find related data.
7. The computer-implemented method of claim 1, wherein executing the search includes executing the search to find within a defined time range both the log data and the data obtained from the real-time monitoring environment.
8. The computer-implemented method of claim 1, wherein executing the search includes executing the search over a defined time range.
9. The computer-implemented method of claim 1, wherein executing the search includes executing the search to look for a frequency of distribution.
10. The computer-implemented method of claim 1, wherein executing the search includes executing the search to look for a pattern of occurrence.
11. The computer-implemented method of claim 1, further comprising causing display of results of the search.
12. The computer-implemented method of claim 1, wherein executing the search includes executing the search to find within a defined time range both the log data and the data obtained from the real-time monitoring environment, and wherein The computer-implemented method further comprises causing display of results of the search.
13. The computer-implemented method of claim 1, further comprising providing results of the search through an application program interface (API).
14. The computer-implemented method of claim 1, wherein the log data comes from two or more sources.
15. The computer-implemented method of claim 1, wherein the data obtained from the real-time monitoring environment comes from two or more sources.
16. The computer-implemented method of claim 1, wherein at least some of the data obtained from the real-time monitoring environment is obtained synchronously.
17. The computer-implemented method of claim 1, wherein at least some of the data obtained from the real-time monitoring environment is obtained asynchronously.
18. The computer-implemented method of claim 1, wherein at least some of the data obtained from the real-time monitoring environment is obtained synchronously and at least some of the data obtained from the real-time monitoring environment is obtained asynchronously.
19. The computer-implemented method of claim 1, further comprising timestamping the log data prior to storing the log data in the searchable time series data store.
20. The computer-implemented method of claim 1, wherein the obtaining log data comprises collecting the log data at more than one physical location.
21. The computer-implemented method of claim 1, wherein storing the log data comprises:
aggregating the log data into events;
time stamping the events; and
storing the events in the searchable time series data store.
22. The computer-implemented method of claim 1, wherein storing the log data comprises:
aggregating the log data into events;
time stamping the events; and
storing the events in the searchable time series data store in chronological order based on the time stamping.
23. The computer-implemented method of claim 1, wherein storing the log data comprises:
aggregating the log data into events;
classifying the events by domain;
time stamping the events based on the domain; and
storing the events in the searchable time series data store.
24. The computer-implemented method of claim 1, wherein storing the log data comprises:
aggregating the log data into events using extraction to detect the beginning and ending of the events; and
storing the events in the searchable time series data store.
25. The computer-implemented method of claim 1, wherein storing the log data comprises:
aggregating the log data into events using machine learning to identify boundaries between events; and
storing the events in the searchable time series data store.
26. The computer-implemented method of claim 1, wherein storing the log data comprises:
aggregating the log data into events;
time stamping the events; and
combining a group of events into a hot index, which is not searchable and does not persist; and
converting the hot index into a warm index when the hot index is at capacity, the warm index being stored in the searchable time series data store.
27. A system comprising:
a memory; and
a processing device coupled with the memory to:
obtain log data generated by at least one component in an information processing environment,
obtain data from a real-time monitoring environment,
store the log data in a searchable time series data store,
store the data obtained from the real-time monitoring environment in the searchable time series data store, and
execute a search on the log data and the data obtained from the real-time monitoring environment in the searchable time series data store.
28. The system of claim 27, wherein to store the log data, the processing device is coupled with the memory to:
aggregate the log data into events,
time stamp the events, and
store the events in one or more time buckets in the searchable time series data store based on the time stamp.
29. A non-transitory computer-readable medium encoding instructions thereon that, in response to execution by one or more processing devices, cause the one or more processing devices to perform operations comprising:
obtaining log data generated by at least one component in an information processing environment;
obtaining data from a real-time monitoring environment;
storing the log data in a searchable time series data store;
storing the data obtained from the real-time monitoring environment in the searchable time series data store; and
executing a search on the log data and the data obtained from the real-time monitoring environment in the searchable time series data store.
30. The computer-readable readable medium of claim 29, wherein storing the log data comprises:
aggregating the log data into events;
time stamping the events; and
storing the events in the searchable time series data store.
US15/420,985 2006-10-05 2017-01-31 Unified time series search across both log data and data from a real-time monitoring environment Abandoned US20170139962A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US15/420,985 US20170139962A1 (en) 2006-10-05 2017-01-31 Unified time series search across both log data and data from a real-time monitoring environment

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US82828306P 2006-10-05 2006-10-05
US11/868,370 US8112425B2 (en) 2006-10-05 2007-10-05 Time series search engine
US13/353,135 US9002854B2 (en) 2006-10-05 2012-01-18 Time series search with interpolated time stamp
US14/611,170 US9594789B2 (en) 2006-10-05 2015-01-30 Time series search in primary and secondary memory
US15/420,985 US20170139962A1 (en) 2006-10-05 2017-01-31 Unified time series search across both log data and data from a real-time monitoring environment

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US14/611,170 Continuation US9594789B2 (en) 2006-10-05 2015-01-30 Time series search in primary and secondary memory

Publications (1)

Publication Number Publication Date
US20170139962A1 true US20170139962A1 (en) 2017-05-18

Family

ID=39269257

Family Applications (32)

Application Number Title Priority Date Filing Date
US11/868,370 Active 2029-06-11 US8112425B2 (en) 2006-10-05 2007-10-05 Time series search engine
US13/353,135 Active US9002854B2 (en) 2006-10-05 2012-01-18 Time series search with interpolated time stamp
US13/664,186 Active US8990184B2 (en) 2006-10-05 2012-10-30 Time series search engine
US14/611,170 Active US9594789B2 (en) 2006-10-05 2015-01-30 Time series search in primary and secondary memory
US14/815,980 Active US10678767B2 (en) 2006-10-05 2015-08-01 Search query processing using operational parameters
US14/929,248 Active 2027-11-02 US9922065B2 (en) 2006-10-05 2015-10-30 Determining timestamps to be associated with events in machine data
US15/007,176 Active US9514175B2 (en) 2006-10-05 2016-01-26 Normalization of time stamps for event data
US15/008,425 Active US10216779B2 (en) 2006-10-05 2016-01-27 Expiration of persistent data structures that satisfy search queries
US15/008,428 Active US9922066B2 (en) 2006-10-05 2016-01-27 Aggregation and display of search results from multi-criteria search queries on event data
US15/339,887 Active US10255312B2 (en) 2006-10-05 2016-10-31 Time stamp creation for event data
US15/339,953 Active 2027-10-27 US10262018B2 (en) 2006-10-05 2016-11-01 Application of search policies to searches on event data stored in persistent data structures
US15/421,068 Abandoned US20170139963A1 (en) 2006-10-05 2017-01-31 Query-initiated search across separate stores for log data and data from a real-time monitoring environment
US15/420,938 Active US9747316B2 (en) 2006-10-05 2017-01-31 Search based on a relationship between log data and data from a real-time monitoring environment
US15/421,416 Active US10242039B2 (en) 2006-10-05 2017-01-31 Source differentiation of machine data
US15/420,985 Abandoned US20170139962A1 (en) 2006-10-05 2017-01-31 Unified time series search across both log data and data from a real-time monitoring environment
US15/661,260 Active US9996571B2 (en) 2006-10-05 2017-07-27 Storing and executing a search on log data and data obtained from a real-time monitoring environment
US15/661,286 Active US9922067B2 (en) 2006-10-05 2017-07-27 Storing log data as events and performing a search on the log data and data obtained from a real-time monitoring environment
US15/661,268 Active US9928262B2 (en) 2006-10-05 2017-07-27 Log data time stamp extraction and search on log data real-time monitoring environment
US15/883,552 Active 2028-09-30 US10747742B2 (en) 2006-10-05 2018-01-30 Storing log data and performing a search on the log data and data that is not log data
US15/883,588 Active 2028-09-29 US10740313B2 (en) 2006-10-05 2018-01-30 Storing events associated with a time stamp extracted from log data and performing a search on the events and data that is not log data
US15/885,806 Active 2028-06-07 US10977233B2 (en) 2006-10-05 2018-01-31 Aggregating search results from a plurality of searches executed across time series data
US15/963,740 Active 2028-07-27 US10891281B2 (en) 2006-10-05 2018-04-26 Storing events derived from log data and performing a search on the events and data that is not log data
US16/264,618 Abandoned US20190171630A1 (en) 2006-10-05 2019-01-31 Determining Time Stamps in Machine Data Derived Events
US16/264,587 Active 2028-05-15 US11144526B2 (en) 2006-10-05 2019-01-31 Applying time-based search phrases across event data
US16/264,571 Abandoned US20190213180A1 (en) 2006-10-05 2019-01-31 Searching Packed Raw Data in Events Using Segment Lookup Data Structures
US16/264,610 Active 2028-06-08 US11249971B2 (en) 2006-10-05 2019-01-31 Segmenting machine data using token-based signatures
US16/779,552 Abandoned US20200174986A1 (en) 2006-10-05 2020-01-31 Time series search phrase processing
US17/125,807 Active US11561952B2 (en) 2006-10-05 2020-12-17 Storing events derived from log data and performing a search on the events and data that is not log data
US17/243,966 Active US11537585B2 (en) 2006-10-05 2021-04-29 Determining time stamps in machine data derived events
US17/243,967 Active US11550772B2 (en) 2006-10-05 2021-04-29 Time series search phrase processing
US17/589,818 Active US11526482B2 (en) 2006-10-05 2022-01-31 Determining timestamps to be associated with events in machine data
US18/069,914 Active US11947513B2 (en) 2006-10-05 2022-12-21 Search phrase processing

Family Applications Before (14)

Application Number Title Priority Date Filing Date
US11/868,370 Active 2029-06-11 US8112425B2 (en) 2006-10-05 2007-10-05 Time series search engine
US13/353,135 Active US9002854B2 (en) 2006-10-05 2012-01-18 Time series search with interpolated time stamp
US13/664,186 Active US8990184B2 (en) 2006-10-05 2012-10-30 Time series search engine
US14/611,170 Active US9594789B2 (en) 2006-10-05 2015-01-30 Time series search in primary and secondary memory
US14/815,980 Active US10678767B2 (en) 2006-10-05 2015-08-01 Search query processing using operational parameters
US14/929,248 Active 2027-11-02 US9922065B2 (en) 2006-10-05 2015-10-30 Determining timestamps to be associated with events in machine data
US15/007,176 Active US9514175B2 (en) 2006-10-05 2016-01-26 Normalization of time stamps for event data
US15/008,425 Active US10216779B2 (en) 2006-10-05 2016-01-27 Expiration of persistent data structures that satisfy search queries
US15/008,428 Active US9922066B2 (en) 2006-10-05 2016-01-27 Aggregation and display of search results from multi-criteria search queries on event data
US15/339,887 Active US10255312B2 (en) 2006-10-05 2016-10-31 Time stamp creation for event data
US15/339,953 Active 2027-10-27 US10262018B2 (en) 2006-10-05 2016-11-01 Application of search policies to searches on event data stored in persistent data structures
US15/421,068 Abandoned US20170139963A1 (en) 2006-10-05 2017-01-31 Query-initiated search across separate stores for log data and data from a real-time monitoring environment
US15/420,938 Active US9747316B2 (en) 2006-10-05 2017-01-31 Search based on a relationship between log data and data from a real-time monitoring environment
US15/421,416 Active US10242039B2 (en) 2006-10-05 2017-01-31 Source differentiation of machine data

Family Applications After (17)

Application Number Title Priority Date Filing Date
US15/661,260 Active US9996571B2 (en) 2006-10-05 2017-07-27 Storing and executing a search on log data and data obtained from a real-time monitoring environment
US15/661,286 Active US9922067B2 (en) 2006-10-05 2017-07-27 Storing log data as events and performing a search on the log data and data obtained from a real-time monitoring environment
US15/661,268 Active US9928262B2 (en) 2006-10-05 2017-07-27 Log data time stamp extraction and search on log data real-time monitoring environment
US15/883,552 Active 2028-09-30 US10747742B2 (en) 2006-10-05 2018-01-30 Storing log data and performing a search on the log data and data that is not log data
US15/883,588 Active 2028-09-29 US10740313B2 (en) 2006-10-05 2018-01-30 Storing events associated with a time stamp extracted from log data and performing a search on the events and data that is not log data
US15/885,806 Active 2028-06-07 US10977233B2 (en) 2006-10-05 2018-01-31 Aggregating search results from a plurality of searches executed across time series data
US15/963,740 Active 2028-07-27 US10891281B2 (en) 2006-10-05 2018-04-26 Storing events derived from log data and performing a search on the events and data that is not log data
US16/264,618 Abandoned US20190171630A1 (en) 2006-10-05 2019-01-31 Determining Time Stamps in Machine Data Derived Events
US16/264,587 Active 2028-05-15 US11144526B2 (en) 2006-10-05 2019-01-31 Applying time-based search phrases across event data
US16/264,571 Abandoned US20190213180A1 (en) 2006-10-05 2019-01-31 Searching Packed Raw Data in Events Using Segment Lookup Data Structures
US16/264,610 Active 2028-06-08 US11249971B2 (en) 2006-10-05 2019-01-31 Segmenting machine data using token-based signatures
US16/779,552 Abandoned US20200174986A1 (en) 2006-10-05 2020-01-31 Time series search phrase processing
US17/125,807 Active US11561952B2 (en) 2006-10-05 2020-12-17 Storing events derived from log data and performing a search on the events and data that is not log data
US17/243,966 Active US11537585B2 (en) 2006-10-05 2021-04-29 Determining time stamps in machine data derived events
US17/243,967 Active US11550772B2 (en) 2006-10-05 2021-04-29 Time series search phrase processing
US17/589,818 Active US11526482B2 (en) 2006-10-05 2022-01-31 Determining timestamps to be associated with events in machine data
US18/069,914 Active US11947513B2 (en) 2006-10-05 2022-12-21 Search phrase processing

Country Status (6)

Country Link
US (32) US8112425B2 (en)
EP (3) EP3493074A1 (en)
JP (1) JP5264740B2 (en)
KR (2) KR101266267B1 (en)
CN (2) CN101641674B (en)
WO (1) WO2008043082A2 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9747316B2 (en) 2006-10-05 2017-08-29 Splunk Inc. Search based on a relationship between log data and data from a real-time monitoring environment
US10019496B2 (en) 2013-04-30 2018-07-10 Splunk Inc. Processing of performance data and log data from an information technology environment by using diverse data stores
US10225136B2 (en) 2013-04-30 2019-03-05 Splunk Inc. Processing of log data and performance data obtained via an application programming interface (API)
US10318541B2 (en) 2013-04-30 2019-06-11 Splunk Inc. Correlating log data with performance measurements having a specified relationship to a threshold value
US10346357B2 (en) 2013-04-30 2019-07-09 Splunk Inc. Processing of performance data and structure data from an information technology environment
US10353957B2 (en) 2013-04-30 2019-07-16 Splunk Inc. Processing of performance data and raw log data from an information technology environment
US10614132B2 (en) 2013-04-30 2020-04-07 Splunk Inc. GUI-triggered processing of performance data and log data from an information technology environment
US10922298B2 (en) 2019-01-30 2021-02-16 Walmart Apollo, Llc System and method for indexing time-series-based data
US10997191B2 (en) 2013-04-30 2021-05-04 Splunk Inc. Query-triggered processing of performance data and log data from an information technology environment

Families Citing this family (752)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7937344B2 (en) 2005-07-25 2011-05-03 Splunk Inc. Machine data web
US8229902B2 (en) 2006-11-01 2012-07-24 Ab Initio Technology Llc Managing storage of individually accessible data units
US7885932B2 (en) 2006-11-01 2011-02-08 Ab Initio Technology Llc Managing storage of individually accessible data units
US8930331B2 (en) 2007-02-21 2015-01-06 Palantir Technologies Providing unique views of data based on changes or rules
US20090132581A1 (en) * 2007-05-29 2009-05-21 Christopher Ahlberg Information service for facts extracted from differing sources on a wide area network
US10698886B2 (en) 2007-08-14 2020-06-30 John Nicholas And Kristin Gross Trust U/A/D Temporal based online search and advertising
US8484115B2 (en) 2007-10-03 2013-07-09 Palantir Technologies, Inc. Object-oriented time series generator
JP5233233B2 (en) * 2007-10-05 2013-07-10 日本電気株式会社 Information search system, information search index registration device, information search method and program
US8145643B2 (en) * 2007-12-04 2012-03-27 Yahoo! Inc. Time based ordering of provided mobile content
US20090234806A1 (en) * 2008-03-13 2009-09-17 International Business Machines Corporation Displaying search results using software development process information
US10102091B2 (en) 2008-06-04 2018-10-16 Oracle International Corporation System and method for supporting a testing framework for an event processing system using multiple input event streams
US9892009B2 (en) 2008-06-04 2018-02-13 Oracle International Corporation System and method for supporting a sliding window for testing an event processing system
US9348499B2 (en) 2008-09-15 2016-05-24 Palantir Technologies, Inc. Sharing objects that rely on local resources with outside servers
US20100070426A1 (en) * 2008-09-15 2010-03-18 Palantir Technologies, Inc. Object modeling for exploring large data sets
JP4725622B2 (en) * 2008-09-22 2011-07-13 日本電気株式会社 Log management apparatus, system, method, and program
JP5465413B2 (en) * 2008-10-29 2014-04-09 株式会社日立製作所 Stream data processing method and system
US8732139B2 (en) 2008-12-18 2014-05-20 Sap Ag Method and system for dynamically partitioning very large database indices on write-once tables
US20100179851A1 (en) * 2009-01-12 2010-07-15 Oracle International Corporation Method and systems for generating enterprise plans using iterative disaggregation
US8468153B2 (en) * 2009-01-21 2013-06-18 Recorded Future, Inc. Information service for facts extracted from differing sources on a wide area network
US8219574B2 (en) * 2009-06-22 2012-07-10 Microsoft Corporation Querying compressed time-series signals
JP5712932B2 (en) * 2009-12-11 2015-05-07 日本電気株式会社 Information processing apparatus, information processing method, and program
US8527496B2 (en) * 2010-02-11 2013-09-03 Facebook, Inc. Real time content searching in social network
JP5423553B2 (en) * 2010-04-09 2014-02-19 株式会社日立製作所 Database management method, computer, sensor network system, and database search program
KR20130083829A (en) * 2010-04-30 2013-07-23 톰슨 라이센싱 Automatic image discovery and recommendation for displayed television content
KR100982034B1 (en) * 2010-05-11 2010-09-13 주식회사 이피아이솔루션즈 Monitoring method and system for database performance
US9298854B2 (en) * 2010-05-14 2016-03-29 Hitachi, Ltd. Time-series data management device, system, method, and program
CN102947829B (en) * 2010-06-18 2016-03-16 三菱电机株式会社 Data processing equipment and data processing method
US8558868B2 (en) * 2010-07-01 2013-10-15 Cisco Technology, Inc. Conference participant visualization
US8326836B1 (en) * 2010-07-13 2012-12-04 Google Inc. Providing time series information with search results
CN102347851B (en) * 2010-07-29 2014-04-09 阿里巴巴集团控股有限公司 Event processing method and server thereof
US8560520B2 (en) 2010-08-26 2013-10-15 Microsoft Corporation Information retrieval using time
US8306953B2 (en) * 2010-08-31 2012-11-06 International Business Machines Corporation Online management of historical data for efficient reporting and analytics
CN102419756A (en) * 2010-09-28 2012-04-18 腾讯科技(深圳)有限公司 Distributed data page turning method and system
JP5678620B2 (en) * 2010-12-03 2015-03-04 株式会社日立製作所 Data processing method, data processing system, and data processing apparatus
JP5713652B2 (en) * 2010-12-13 2015-05-07 キヤノン株式会社 Data search apparatus, method, and program
US20120239681A1 (en) 2011-03-14 2012-09-20 Splunk Inc. Scalable interactive display of distributed data
US9075830B2 (en) * 2011-03-24 2015-07-07 Morphism Llc Propagation through perdurance
US8566336B2 (en) 2011-03-30 2013-10-22 Splunk Inc. File identification management and tracking
US9547693B1 (en) 2011-06-23 2017-01-17 Palantir Technologies Inc. Periodic database search manager for multiple data sources
US9092482B2 (en) 2013-03-14 2015-07-28 Palantir Technologies, Inc. Fair scheduling for mixed-query loads
US8799240B2 (en) 2011-06-23 2014-08-05 Palantir Technologies, Inc. System and method for investigating large amounts of data
US8983912B1 (en) 2011-06-30 2015-03-17 Sumo Logic Data collection and transmission
US8862537B1 (en) 2011-06-30 2014-10-14 Sumo Logic Selective structure preserving obfuscation
US8930380B1 (en) 2011-06-30 2015-01-06 Sumo Logic Automatic parser generation
US9135560B1 (en) 2011-06-30 2015-09-15 Sumo Logic Automatic parser selection and usage
US20130013605A1 (en) * 2011-07-08 2013-01-10 Stanfill Craig W Managing Storage of Data for Range-Based Searching
US9280532B2 (en) 2011-08-02 2016-03-08 Palantir Technologies, Inc. System and method for accessing rich objects via spreadsheets
US9256361B2 (en) 2011-08-03 2016-02-09 Ebay Inc. Control of search results with multipoint pinch gestures
WO2013025556A1 (en) 2011-08-12 2013-02-21 Splunk Inc. Elastic scaling of data volume
US8732574B2 (en) 2011-08-25 2014-05-20 Palantir Technologies, Inc. System and method for parameterizing documents for automatic workflow generation
US8504542B2 (en) 2011-09-02 2013-08-06 Palantir Technologies, Inc. Multi-row transactions
US9081834B2 (en) * 2011-10-05 2015-07-14 Cumulus Systems Incorporated Process for gathering and special data structure for storing performance metric data
US9081829B2 (en) 2011-10-05 2015-07-14 Cumulus Systems Incorporated System for organizing and fast searching of massive amounts of data
US20130091266A1 (en) 2011-10-05 2013-04-11 Ajit Bhave System for organizing and fast searching of massive amounts of data
CN103049450B (en) * 2011-10-14 2018-01-23 深圳市世纪光速信息技术有限公司 One kind index establishes apparatus and method
US10860563B2 (en) * 2012-01-06 2020-12-08 Microsoft Technology Licensing, Llc Distributed database with modular blocks and associated log files
KR101426673B1 (en) * 2012-02-14 2014-08-05 주식회사 케이티 Method of Distributed Indexing and Searching for Effective Analysis of Time Series Data in Search System
CN103294720B (en) * 2012-03-01 2017-03-15 腾讯科技(北京)有限公司 A kind of network request event screening technique and device
US10311513B2 (en) * 2012-03-14 2019-06-04 Nasdaq Technology Ab Method and system for facilitating access to recorded data
CN103324642B (en) * 2012-03-23 2016-12-14 日电(中国)有限公司 System and method and the data query method of index is set up for data
US10908792B2 (en) * 2012-04-04 2021-02-02 Recorded Future, Inc. Interactive event-based information system
US8972415B2 (en) 2012-04-30 2015-03-03 Hewlett-Packard Development Company, L.P. Similarity search initialization
US10510116B2 (en) * 2012-05-02 2019-12-17 Imageworks Interactive System and method for modifying various types of assets
US11003687B2 (en) 2012-05-15 2021-05-11 Splunk, Inc. Executing data searches using generation identifiers
US8788459B2 (en) 2012-05-15 2014-07-22 Splunk Inc. Clustering for high availability and disaster recovery
US9130971B2 (en) 2012-05-15 2015-09-08 Splunk, Inc. Site-based search affinity
US10387448B2 (en) 2012-05-15 2019-08-20 Splunk Inc. Replication of summary data in a clustered computing environment
US8682886B2 (en) 2012-05-18 2014-03-25 Splunk Inc. Report acceleration using intermediate summaries of events
US10009391B1 (en) * 2012-05-31 2018-06-26 Leading Market Technologies, Inc. Apparatus and method for acquiring, managing, sharing, monitoring, analyzing and publishing web-based time series data
US9984125B1 (en) * 2012-05-31 2018-05-29 Leading Market Technologies, Inc. Apparatus and method for acquiring, managing, sharing, monitoring, analyzing and publishing web-based time series data
CN102737133B (en) * 2012-06-27 2016-02-17 北京城市网邻信息技术有限公司 A kind of method of real-time search
CN103577456B (en) 2012-07-31 2016-12-21 国际商业机器公司 For the method and apparatus processing time series data
US8825664B2 (en) * 2012-08-17 2014-09-02 Splunk Inc. Indexing preview
JP5880350B2 (en) * 2012-08-24 2016-03-09 富士ゼロックス株式会社 Information search program and information search apparatus
US9411327B2 (en) 2012-08-27 2016-08-09 Johnson Controls Technology Company Systems and methods for classifying data in building automation systems
US9589000B2 (en) 2012-08-30 2017-03-07 Atheer, Inc. Method and apparatus for content association and history tracking in virtual and augmented reality
US9594814B2 (en) 2012-09-07 2017-03-14 Splunk Inc. Advanced field extractor with modification of an extracted field
US9047181B2 (en) * 2012-09-07 2015-06-02 Splunk Inc. Visualization of data from clusters
US9753909B2 (en) 2012-09-07 2017-09-05 Splunk, Inc. Advanced field extractor with multiple positive examples
US20140208217A1 (en) * 2013-01-22 2014-07-24 Splunk Inc. Interface for managing splittable timestamps across event records
US8788525B2 (en) 2012-09-07 2014-07-22 Splunk Inc. Data model for machine data for semantic search
US8751499B1 (en) 2013-01-22 2014-06-10 Splunk Inc. Variable representative sampling under resource constraints
US10394946B2 (en) 2012-09-07 2019-08-27 Splunk Inc. Refining extraction rules based on selected text within events
US20150019537A1 (en) 2012-09-07 2015-01-15 Splunk Inc. Generating Reports from Unstructured Data
US8751963B1 (en) 2013-01-23 2014-06-10 Splunk Inc. Real time indication of previously extracted data fields for regular expressions
US8682906B1 (en) 2013-01-23 2014-03-25 Splunk Inc. Real time display of data field values based on manual editing of regular expressions
US9087105B2 (en) * 2012-10-04 2015-07-21 Adobe Systems Incorporated Rule-based extraction, transformation, and loading of data between disparate data sources
US9734237B2 (en) * 2012-10-08 2017-08-15 Bmc Software, Inc. Progressive analysis for big data
US9348677B2 (en) 2012-10-22 2016-05-24 Palantir Technologies Inc. System and method for batch evaluation programs
CN103794006B (en) 2012-10-31 2016-12-21 国际商业机器公司 For the method and apparatus processing the time series data of multiple sensor
CN103001796A (en) * 2012-11-13 2013-03-27 北界创想(北京)软件有限公司 Method and device for processing weblog data by server
US9183649B2 (en) * 2012-11-15 2015-11-10 International Business Machines Corporation Automatic tuning of value-series analysis tasks based on visual feedback
US9355479B2 (en) * 2012-11-15 2016-05-31 International Business Machines Corporation Automatic tuning of value-series analysis tasks based on visual feedback
US9355170B2 (en) * 2012-11-27 2016-05-31 Hewlett Packard Enterprise Development Lp Causal topic miner
US9747313B2 (en) * 2012-12-19 2017-08-29 Sap Se Timeline index for managing temporal data
US11201802B2 (en) 2012-12-31 2021-12-14 W.W. Grainger, Inc. Systems and methods for providing infrastructure metrics
US9152929B2 (en) 2013-01-23 2015-10-06 Splunk Inc. Real time display of statistics and values for selected regular expressions
US8909642B2 (en) 2013-01-23 2014-12-09 Splunk Inc. Automatic generation of a field-extraction rule based on selections in a sample event
US20150039625A1 (en) * 2013-02-14 2015-02-05 Loggly, Inc. Hierarchical Temporal Event Management
US8909656B2 (en) 2013-03-15 2014-12-09 Palantir Technologies Inc. Filter chains with associated multipath views for exploring large data sets
US10275778B1 (en) 2013-03-15 2019-04-30 Palantir Technologies Inc. Systems and user interfaces for dynamic and interactive investigation based on automatic malfeasance clustering of related data in various data structures
US8855999B1 (en) 2013-03-15 2014-10-07 Palantir Technologies Inc. Method and system for generating a parser and parsing complex data
US8868486B2 (en) 2013-03-15 2014-10-21 Palantir Technologies Inc. Time-sensitive cube
US9230280B1 (en) 2013-03-15 2016-01-05 Palantir Technologies Inc. Clustering data based on indications of financial malfeasance
US8930897B2 (en) 2013-03-15 2015-01-06 Palantir Technologies Inc. Data integration tool
WO2014149028A1 (en) * 2013-03-18 2014-09-25 Ge Intelligent Platforms, Inc. Apparatus and method for optimizing time series data storage
JP6060312B2 (en) * 2013-03-29 2017-01-11 キュムラス システムズ インコーポレイテッドCumulus Systems Incorporated Data organization and high-speed search
US9185007B2 (en) 2013-04-30 2015-11-10 Splunk Inc. Proactive monitoring tree with severity state sorting
US9495187B2 (en) 2013-04-30 2016-11-15 Splunk, Inc. Interactive, top-down presentation of the architecture and performance of a hypervisor environment
US20140324862A1 (en) * 2013-04-30 2014-10-30 Splunk Inc. Correlation for user-selected time ranges of values for performance metrics of components in an information-technology environment with log data from that information-technology environment
US9164786B2 (en) 2013-04-30 2015-10-20 Splunk Inc. Determining performance states of parent components in a virtual-machine environment based on performance states of related child components during a time period
US9142049B2 (en) 2013-04-30 2015-09-22 Splunk Inc. Proactive monitoring tree providing distribution stream chart with branch overlay
US8972992B2 (en) 2013-04-30 2015-03-03 Splunk Inc. Proactive monitoring tree with state distribution ring
US8904389B2 (en) 2013-04-30 2014-12-02 Splunk Inc. Determining performance states of components in a virtual machine environment based on performance states of related subcomponents
US9015716B2 (en) 2013-04-30 2015-04-21 Splunk Inc. Proactive monitoring tree with node pinning for concurrent node comparisons
US9916367B2 (en) 2013-05-03 2018-03-13 Splunk Inc. Processing system search requests from multiple data stores with overlapping data
US8738629B1 (en) 2013-05-03 2014-05-27 Splunk Inc. External Result Provided process for retrieving data stored using a different configuration or protocol
US9594545B2 (en) 2013-06-05 2017-03-14 Splunk Inc. System for displaying notification dependencies between component instances
US8756614B2 (en) 2013-06-05 2014-06-17 Splunk Inc. Central registry for binding features using dynamic pointers
US10061626B2 (en) 2013-06-05 2018-08-28 Splunk Inc. Application framework providing a registry for mapping names to component instances
US9715533B2 (en) 2013-07-02 2017-07-25 Ebay Inc. Multi-dimensional search
US10324942B2 (en) * 2013-07-26 2019-06-18 Snap Inc. Segment data visibility and management in a distributed database of time stamped records
US10574548B2 (en) 2013-07-31 2020-02-25 Splunk Inc. Key indicators view
US8752178B2 (en) 2013-07-31 2014-06-10 Splunk Inc. Blacklisting and whitelisting of security-related events
US9128965B1 (en) 2013-08-07 2015-09-08 Amazon Technologies, Inc. Configurable-capacity time-series tables
CN103399945A (en) * 2013-08-15 2013-11-20 成都博云科技有限公司 Data structure based on cloud computing database system
CA2928492A1 (en) * 2013-09-04 2015-03-12 Know Normal, Inc. Systems and methods for deriving, storing, and visualizing a numeric baseline for time-series numeric data which considers the time, coincidental events, and relevance of the datapoints as part of the derivation and visualization
US8938686B1 (en) 2013-10-03 2015-01-20 Palantir Technologies Inc. Systems and methods for analyzing performance of an entity
US10318970B2 (en) 2013-10-04 2019-06-11 International Business Machines Corporation Generating a succinct approximate representation of a time series
US10395198B2 (en) 2013-10-04 2019-08-27 International Business Machines Corporation Forecasting a time series based on actuals and a plan
US9116975B2 (en) 2013-10-18 2015-08-25 Palantir Technologies Inc. Systems and user interfaces for dynamic and interactive simultaneous querying of multiple data stores
WO2015065435A1 (en) * 2013-10-31 2015-05-07 Hewlett-Packard Development Company, L.P. Storing time series data for a search query
KR102126507B1 (en) * 2013-12-09 2020-06-24 삼성전자주식회사 Terminal, system and method of processing sensor data stream
US9105000B1 (en) 2013-12-10 2015-08-11 Palantir Technologies Inc. Aggregating data from a plurality of data sources
US20160305847A1 (en) * 2013-12-16 2016-10-20 Siemens Aktiengesellschaft Computer device for detecting correlations within data
US20150169583A1 (en) * 2013-12-18 2015-06-18 Attivio, Inc. Trending analysis for streams of documents
US10417258B2 (en) 2013-12-19 2019-09-17 Exposit Labs, Inc. Interactive multi-dimensional nested table supporting scalable real-time querying of large data volumes
EP2887236A1 (en) * 2013-12-23 2015-06-24 D square N.V. System and method for similarity search in process data
US9043696B1 (en) 2014-01-03 2015-05-26 Palantir Technologies Inc. Systems and methods for visual definition of data associations
US9660930B2 (en) 2014-03-17 2017-05-23 Splunk Inc. Dynamic data server nodes
US8990637B1 (en) 2014-03-17 2015-03-24 Splunk Inc. Computing and accessing quality indicators of computer applications
US9838346B2 (en) 2014-03-17 2017-12-05 Splunk Inc. Alerting on dual-queue systems
US9753818B2 (en) 2014-09-19 2017-09-05 Splunk Inc. Data forwarding using multiple data pipelines
US8935201B1 (en) 2014-03-18 2015-01-13 Palantir Technologies Inc. Determining and extracting changed data from a data source
JP6154542B2 (en) * 2014-03-26 2017-06-28 株式会社日立製作所 Time-series data management method and time-series data management system
US9762443B2 (en) 2014-04-15 2017-09-12 Splunk Inc. Transformation of network data at remote capture agents
US10462004B2 (en) 2014-04-15 2019-10-29 Splunk Inc. Visualizations of statistics associated with captured network data
US10523521B2 (en) * 2014-04-15 2019-12-31 Splunk Inc. Managing ephemeral event streams generated from captured network data
KR101565499B1 (en) * 2014-04-16 2015-11-03 주식회사 인포리언스 Data processing apparatus, data processing program, recording medium
US20150331856A1 (en) * 2014-04-30 2015-11-19 Cie Digital Labs, LLC Time-based content aggregator
US10423616B2 (en) 2014-04-30 2019-09-24 Hewlett Packard Enterprise Development Lp Using local memory nodes of a multicore machine to process a search query
US12028208B1 (en) 2014-05-09 2024-07-02 Splunk Inc. Selective event stream data storage based on network traffic volume
US9619557B2 (en) 2014-06-30 2017-04-11 Palantir Technologies, Inc. Systems and methods for key phrase characterization of documents
US9535974B1 (en) 2014-06-30 2017-01-03 Palantir Technologies Inc. Systems and methods for identifying key phrase clusters within documents
US20160147830A1 (en) 2014-07-09 2016-05-26 Splunk Inc. Managing datasets produced by alert-triggering search queries
WO2016008389A1 (en) * 2014-07-16 2016-01-21 谢成火 Method of quickly browsing history information and time period information query system
US20160019316A1 (en) 2014-07-21 2016-01-21 Splunk Inc. Wizard for creating a correlation search
US10019510B2 (en) * 2014-07-29 2018-07-10 Ca, Inc. Indexing and searching log records using templates index and attributes index
US10296616B2 (en) 2014-07-31 2019-05-21 Splunk Inc. Generation of a search query to approximate replication of a cluster of events
US9419992B2 (en) 2014-08-13 2016-08-16 Palantir Technologies Inc. Unwanted tunneling alert system
CN104239448A (en) * 2014-09-01 2014-12-24 北京优特捷信息技术有限公司 Time sequence data timestamp obtaining method and device
CN104216977A (en) * 2014-09-01 2014-12-17 北京优特捷信息技术有限公司 Time series data search method and device
CN104239477A (en) * 2014-09-03 2014-12-24 北京优特捷信息技术有限公司 Method and device for analyzing time series data
CN104239475A (en) * 2014-09-03 2014-12-24 北京优特捷信息技术有限公司 Method and device for analyzing time series data
US9454281B2 (en) 2014-09-03 2016-09-27 Palantir Technologies Inc. System for providing dynamic linked panels in user interface
US9785775B1 (en) * 2014-09-15 2017-10-10 Symantec Corporation Malware management
US10432497B2 (en) 2014-09-19 2019-10-01 Splunk Inc. Injecting custom classes in application code to facilitate network traffic monitoring
US9471362B2 (en) 2014-09-23 2016-10-18 Splunk Inc. Correlating hypervisor data for a virtual machine with associated operating system data
US9195674B1 (en) 2014-09-24 2015-11-24 Logzilla Corporation Systems and methods for large-scale system log analysis, deduplication and management
US9922099B2 (en) 2014-09-30 2018-03-20 Splunk Inc. Event limited field picker
US9990423B2 (en) 2014-09-30 2018-06-05 Splunk Inc. Hybrid cluster-based data intake and query
US10235460B2 (en) 2014-09-30 2019-03-19 Splunk Inc. Sharing configuration information for searches in data intake and query systems
US10261673B2 (en) * 2014-10-05 2019-04-16 Splunk Inc. Statistics value chart interface cell mode drill down
US11231840B1 (en) 2014-10-05 2022-01-25 Splunk Inc. Statistics chart row mode drill down
US11755559B1 (en) 2014-10-09 2023-09-12 Splunk Inc. Automatic entity control in a machine data driven service monitoring system
US9892166B2 (en) * 2014-10-09 2018-02-13 Ca, Inc. Partitioning log records based on term frequency and type for selective skipping during full-text searching
US11455590B2 (en) 2014-10-09 2022-09-27 Splunk Inc. Service monitoring adaptation for maintenance downtime
US11501238B2 (en) * 2014-10-09 2022-11-15 Splunk Inc. Per-entity breakdown of key performance indicators
US11671312B2 (en) * 2014-10-09 2023-06-06 Splunk Inc. Service detail monitoring console
US20160110478A1 (en) * 2014-10-17 2016-04-21 General Electric Company System and methods for quantization and featurization of time-series data
US10567557B2 (en) * 2014-10-31 2020-02-18 Splunk Inc. Automatically adjusting timestamps from remote systems based on time zone differences
US9229952B1 (en) 2014-11-05 2016-01-05 Palantir Technologies, Inc. History preserving data pipeline system and method
CN104408197B (en) * 2014-12-15 2017-12-19 北京国双科技有限公司 The processing method and processing device of thermodynamic chart
US10552994B2 (en) 2014-12-22 2020-02-04 Palantir Technologies Inc. Systems and interactive user interfaces for dynamic retrieval, analysis, and triage of data items
US10362133B1 (en) 2014-12-22 2019-07-23 Palantir Technologies Inc. Communication data processing architecture
US9348920B1 (en) 2014-12-22 2016-05-24 Palantir Technologies Inc. Concept indexing among database of documents using machine learning techniques
US10452651B1 (en) 2014-12-23 2019-10-22 Palantir Technologies Inc. Searching charts
US10037368B1 (en) * 2014-12-23 2018-07-31 VCE IP Holding Company LLC Methods, systems, and computer readable mediums for performing a free-form query
US9817563B1 (en) 2014-12-29 2017-11-14 Palantir Technologies Inc. System and method of generating data points from one or more data stores of data items for chart creation and manipulation
CN105808621B (en) * 2014-12-31 2019-06-04 北京奇虎科技有限公司 A kind of method and apparatus calculating response time search time
CN107111643B (en) * 2015-01-22 2018-12-28 三菱电机株式会社 Time series data retrieves device
US9916326B2 (en) 2015-01-27 2018-03-13 Splunk, Inc. Efficient point-in-polygon indexing technique for facilitating geofencing operations
US9836874B2 (en) 2015-01-27 2017-12-05 Splunk Inc. Efficient polygon-clipping technique to reduce data transfer requirements for a viewport
US10026204B2 (en) 2015-01-27 2018-07-17 Splunk Inc. Efficient point-in-polygon indexing technique for processing queries over geographic data sets
US9767122B2 (en) 2015-01-27 2017-09-19 Splunk Inc. Efficient point-in-polygon indexing technique to facilitate displaying geographic data
US9607414B2 (en) 2015-01-27 2017-03-28 Splunk Inc. Three-dimensional point-in-polygon operation to facilitate displaying three-dimensional structures
US9921733B2 (en) 2015-01-28 2018-03-20 Splunk Inc. Graphical interface for automatically binned information
US10013454B2 (en) 2015-01-30 2018-07-03 Splunk Inc. Text-based table manipulation of event data
US9916346B2 (en) 2015-01-30 2018-03-13 Splunk Inc. Interactive command entry list
US9842160B2 (en) 2015-01-30 2017-12-12 Splunk, Inc. Defining fields from particular occurences of field labels in events
US10061824B2 (en) 2015-01-30 2018-08-28 Splunk Inc. Cell-based table manipulation of event data
US10915583B2 (en) 2015-01-30 2021-02-09 Splunk Inc. Suggested field extraction
US11615073B2 (en) 2015-01-30 2023-03-28 Splunk Inc. Supplementing events displayed in a table format
US9922084B2 (en) 2015-01-30 2018-03-20 Splunk Inc. Events sets in a visually distinct display format
US9977803B2 (en) 2015-01-30 2018-05-22 Splunk Inc. Column-based table manipulation of event data
US11442924B2 (en) 2015-01-30 2022-09-13 Splunk Inc. Selective filtered summary graph
US10037331B2 (en) 2015-01-30 2018-07-31 Splunk Inc. Source type management
US10726037B2 (en) 2015-01-30 2020-07-28 Splunk Inc. Automatic field extraction from filed values
US10572863B2 (en) 2015-01-30 2020-02-25 Splunk Inc. Systems and methods for managing allocation of machine data storage
US11544248B2 (en) 2015-01-30 2023-01-03 Splunk Inc. Selective query loading across query interfaces
US9922114B2 (en) 2015-01-30 2018-03-20 Splunk Inc. Systems and methods for distributing indexer configurations
US9922082B2 (en) 2015-01-30 2018-03-20 Splunk Inc. Enforcing dependency between pipelines
US9922037B2 (en) 2015-01-30 2018-03-20 Splunk Inc. Index time, delimiter based extractions and previewing for use in indexing
US10152480B2 (en) 2015-01-31 2018-12-11 Splunk Inc. Archiving indexed data
CN105989048B (en) * 2015-02-05 2019-12-24 浙江大华技术股份有限公司 Data record processing method, device and system
US10255292B2 (en) * 2015-03-17 2019-04-09 Vmware, Inc. Virtual-log-file method and system
GB2536655B (en) * 2015-03-24 2017-09-20 Imagination Tech Ltd Logging events with timestamps
CN114153702A (en) 2015-04-03 2022-03-08 甲骨文国际公司 Method and system for implementing a log parser in a log analysis system
US11226975B2 (en) 2015-04-03 2022-01-18 Oracle International Corporation Method and system for implementing machine learning classifications
US11727025B2 (en) 2015-04-03 2023-08-15 Oracle International Corporation Method and system for implementing a log parser in a log analytics system
US10282455B2 (en) 2015-04-20 2019-05-07 Splunk Inc. Display of data ingestion information based on counting generated events
US10817544B2 (en) * 2015-04-20 2020-10-27 Splunk Inc. Scaling available storage based on counting generated events
US10049473B2 (en) 2015-04-27 2018-08-14 Splunk Inc Systems and methods for providing for third party visualizations
US10268755B2 (en) 2015-04-30 2019-04-23 Splunk Inc. Systems and methods for providing dynamic indexer discovery
US20160328432A1 (en) * 2015-05-06 2016-11-10 Squigglee LLC System and method for management of time series data sets
US9710258B2 (en) 2015-05-27 2017-07-18 International Business Machines Corporation Identifying cause of incidents in the DevOps environment automatically
US10339467B2 (en) 2015-06-02 2019-07-02 International Business Machines Corporation Quantitative discovery of name changes
US9672257B2 (en) * 2015-06-05 2017-06-06 Palantir Technologies Inc. Time-series data storage and processing database system
US9384203B1 (en) 2015-06-09 2016-07-05 Palantir Technologies Inc. Systems and methods for indexing and aggregating data records
US9875281B2 (en) 2015-06-10 2018-01-23 International Business Machines Corporation Performance monitoring of wait times
US10866994B2 (en) 2015-06-23 2020-12-15 Splunk Inc. Systems and methods for instant crawling, curation of data sources, and enabling ad-hoc search
US9407652B1 (en) 2015-06-26 2016-08-02 Palantir Technologies Inc. Network anomaly detection
CN105005617B (en) * 2015-07-21 2018-10-12 领航动力信息系统有限公司 A kind of storage method and device of time series data
US9996595B2 (en) 2015-08-03 2018-06-12 Palantir Technologies, Inc. Providing full data provenance visualization for versioned datasets
WO2017030550A1 (en) * 2015-08-17 2017-02-23 Hewlett Packard Enterprise Development Lp Confidence indicator of unreceived security message
JP2018523862A (en) * 2015-08-18 2018-08-23 グーグル エルエルシー Time series explorer
US9537880B1 (en) 2015-08-19 2017-01-03 Palantir Technologies Inc. Anomalous network monitoring, user behavior detection and database system
US10402385B1 (en) 2015-08-27 2019-09-03 Palantir Technologies Inc. Database live reindex
US10049140B2 (en) * 2015-08-28 2018-08-14 International Business Machines Corporation Encoding system, method, and recording medium for time grams
US10311042B1 (en) 2015-08-31 2019-06-04 Commvault Systems, Inc. Organically managing primary and secondary storage of a data object based on expiry timeframe supplied by a user of the data object
US9454564B1 (en) 2015-09-09 2016-09-27 Palantir Technologies Inc. Data integrity checks
US9576015B1 (en) 2015-09-09 2017-02-21 Palantir Technologies, Inc. Domain-specific language for dataset transformations
US10459819B2 (en) 2015-09-21 2019-10-29 Splunk Inc. Circular timeline displays of timestamped event data
US10536356B2 (en) 2015-09-21 2020-01-14 Splunk Inc. Generating and displaying topology map time-lapses of cloud computing resources
US10692533B2 (en) 2015-10-07 2020-06-23 Google Llc Storyline experience
US10044745B1 (en) 2015-10-12 2018-08-07 Palantir Technologies, Inc. Systems for computer network security risk assessment including user compromise analysis associated with a network of devices
US10534326B2 (en) 2015-10-21 2020-01-14 Johnson Controls Technology Company Building automation system with integrated building information model
KR101656077B1 (en) * 2015-10-28 2016-09-08 주식회사 인피니플럭스 System and method for time base partitioning using implicit time column value
US10572557B2 (en) * 2015-11-04 2020-02-25 International Business Machines Corporation Detecting relevant facets by leveraging diagram identification, soical media and statistical analysis software
US10685043B2 (en) 2015-11-10 2020-06-16 International Business Machines Corporation Event analysis in network management event streams
US9542446B1 (en) 2015-12-17 2017-01-10 Palantir Technologies, Inc. Automatic generation of composite datasets based on hierarchical fields
KR102512828B1 (en) * 2016-01-22 2023-03-22 삼성전자주식회사 Event signal processing method and apparatus
US11268732B2 (en) 2016-01-22 2022-03-08 Johnson Controls Technology Company Building energy management system with energy analytics
US11947785B2 (en) 2016-01-22 2024-04-02 Johnson Controls Technology Company Building system with a building graph
US10235431B2 (en) 2016-01-29 2019-03-19 Splunk Inc. Optimizing index file sizes based on indexed data storage conditions
US10713314B2 (en) 2016-01-29 2020-07-14 Splunk Inc. Facilitating data model acceleration in association with an external data system
US10534791B1 (en) * 2016-01-31 2020-01-14 Splunk Inc. Analysis of tokenized HTTP event collector
US10169434B1 (en) 2016-01-31 2019-01-01 Splunk Inc. Tokenized HTTP event collector
US10394802B1 (en) 2016-01-31 2019-08-27 Splunk, Inc. Interactive location queries for raw machine data
US10248621B2 (en) * 2016-02-09 2019-04-02 Moonshadow Mobile, Inc. Systems and methods for storing, updating, searching, and filtering time-series datasets
US9892020B1 (en) 2016-03-11 2018-02-13 Signalfx, Inc. User interface for specifying data stream processing language programs for analyzing instrumented software
US10824951B2 (en) * 2016-03-14 2020-11-03 Huawei Technologies Co., Ltd. System and method for rule generation using data processed by a binary classifier
US20170277997A1 (en) * 2016-03-23 2017-09-28 Nec Laboratories America, Inc. Invariants Modeling and Detection for Heterogeneous Logs
US10796235B2 (en) 2016-03-25 2020-10-06 Uptake Technologies, Inc. Computer systems and methods for providing a visualization of asset event and signal data
US11768004B2 (en) 2016-03-31 2023-09-26 Johnson Controls Tyco IP Holdings LLP HVAC device registration in a distributed building management system
WO2017180153A1 (en) * 2016-04-15 2017-10-19 Entit Software Llc Removing wildcard tokens from a set of wildcard tokens for a search query
US10496605B2 (en) 2016-04-29 2019-12-03 Splunk Inc. Application deployment for data intake and query system
US10505756B2 (en) 2017-02-10 2019-12-10 Johnson Controls Technology Company Building management system with space graphs
US11774920B2 (en) 2016-05-04 2023-10-03 Johnson Controls Technology Company Building system with user presentation composition based on building context
US10417451B2 (en) 2017-09-27 2019-09-17 Johnson Controls Technology Company Building system with smart entity personal identifying information (PII) masking
US10628079B1 (en) * 2016-05-27 2020-04-21 EMC IP Holding Company LLC Data caching for time-series analysis application
US10095747B1 (en) * 2016-06-06 2018-10-09 @Legal Discovery LLC Similar document identification using artificial intelligence
US10387495B2 (en) * 2016-06-10 2019-08-20 Palo Alto Research Center Incorporated System and method for efficient interval search using locality-preserving hashing
CN107491458B (en) * 2016-06-13 2021-08-31 阿里巴巴集团控股有限公司 Method, device and system for storing time series data
US10007674B2 (en) 2016-06-13 2018-06-26 Palantir Technologies Inc. Data revision control in large-scale data analytic systems
US10678802B2 (en) * 2016-06-15 2020-06-09 International Business Machines Corporation Log management utilizing time context indexing
US10146609B1 (en) 2016-07-08 2018-12-04 Splunk Inc. Configuration of continuous anomaly detection service
US10412148B2 (en) * 2016-07-21 2019-09-10 Microsoft Technology Licensing, Llc Systems and methods for event aggregation to reduce network bandwidth and improve network and activity feed server efficiency
US11227208B2 (en) 2016-07-29 2022-01-18 Splunk Inc. Automated data-generation for event-based system
US10460255B2 (en) 2016-07-29 2019-10-29 Splunk Inc. Machine learning in edge analytics
US10536351B2 (en) * 2016-07-29 2020-01-14 Splunk Inc. Analytics for edge devices
US11314799B2 (en) 2016-07-29 2022-04-26 Splunk Inc. Event-based data intake and query system employing non-text machine data
US10552728B2 (en) 2016-07-29 2020-02-04 Splunk Inc. Automated anomaly detection for event-based system
US10956481B2 (en) 2016-07-29 2021-03-23 Splunk Inc. Event-based correlation of non-text machine data
US11087236B2 (en) 2016-07-29 2021-08-10 Splunk Inc. Transmitting machine learning models to edge devices for edge analytics
US9753935B1 (en) 2016-08-02 2017-09-05 Palantir Technologies Inc. Time-series data storage and processing database system
CN106294792B (en) * 2016-08-15 2019-05-31 上海携程商务有限公司 The method for building up of correlation inquiry system and establish system
US20180052858A1 (en) * 2016-08-16 2018-02-22 Netscout Systems Texas, Llc Methods and procedures for timestamp-based indexing of items in real-time storage
US10482084B2 (en) 2016-08-16 2019-11-19 Netscout Systems Texas, Llc Optimized merge-sorting of data retrieved from parallel storage units
US10713252B1 (en) * 2016-08-29 2020-07-14 EMC IP Holding Company LLC Methods, systems, and computer readable mediums for performing an aggregated free-form query
US11874691B1 (en) 2016-09-26 2024-01-16 Splunk Inc. Managing efficient query execution including mapping of buckets to search nodes
US11416528B2 (en) 2016-09-26 2022-08-16 Splunk Inc. Query acceleration data store
US11126632B2 (en) 2016-09-26 2021-09-21 Splunk Inc. Subquery generation based on search configuration data from an external data system
US11232100B2 (en) 2016-09-26 2022-01-25 Splunk Inc. Resource allocation for multiple datasets
US10726009B2 (en) 2016-09-26 2020-07-28 Splunk Inc. Query processing using query-resource usage and node utilization data
US11093476B1 (en) * 2016-09-26 2021-08-17 Splunk Inc. HTTP events with custom fields
US11003714B1 (en) 2016-09-26 2021-05-11 Splunk Inc. Search node and bucket identification using a search node catalog and a data store catalog
US11157498B1 (en) 2016-09-26 2021-10-26 Splunk Inc. Query generation using a dataset association record of a metadata catalog
US11442935B2 (en) 2016-09-26 2022-09-13 Splunk Inc. Determining a record generation estimate of a processing task
US20180089324A1 (en) * 2016-09-26 2018-03-29 Splunk Inc. Dynamic resource allocation for real-time search
US11106734B1 (en) 2016-09-26 2021-08-31 Splunk Inc. Query execution using containerized state-free search nodes in a containerized scalable environment
US10909140B2 (en) 2016-09-26 2021-02-02 Splunk Inc. Clustering events based on extraction rules
US11615104B2 (en) 2016-09-26 2023-03-28 Splunk Inc. Subquery generation based on a data ingest estimate of an external data system
US11243963B2 (en) 2016-09-26 2022-02-08 Splunk Inc. Distributing partial results to worker nodes from an external data system
US11222066B1 (en) 2016-09-26 2022-01-11 Splunk Inc. Processing data using containerized state-free indexing nodes in a containerized scalable environment
US11461334B2 (en) 2016-09-26 2022-10-04 Splunk Inc. Data conditioning for dataset destination
US11023463B2 (en) 2016-09-26 2021-06-01 Splunk Inc. Converting and modifying a subquery for an external data system
US11250371B2 (en) 2016-09-26 2022-02-15 Splunk Inc. Managing process analytics across process components
US10353965B2 (en) * 2016-09-26 2019-07-16 Splunk Inc. Data fabric service system architecture
US11200246B2 (en) 2016-09-26 2021-12-14 Splunk Inc. Hash bucketing of data
US11281706B2 (en) 2016-09-26 2022-03-22 Splunk Inc. Multi-layer partition allocation for query execution
US11604795B2 (en) 2016-09-26 2023-03-14 Splunk Inc. Distributing partial results from an external data system between worker nodes
US11250056B1 (en) 2016-09-26 2022-02-15 Splunk Inc. Updating a location marker of an ingestion buffer based on storing buckets in a shared storage system
US11620336B1 (en) 2016-09-26 2023-04-04 Splunk Inc. Managing and storing buckets to a remote shared storage system based on a collective bucket size
US11314753B2 (en) 2016-09-26 2022-04-26 Splunk Inc. Execution of a query received from a data intake and query system
US11269939B1 (en) 2016-09-26 2022-03-08 Splunk Inc. Iterative message-based data processing including streaming analytics
US11550847B1 (en) 2016-09-26 2023-01-10 Splunk Inc. Hashing bucket identifiers to identify search nodes for efficient query execution
US11580107B2 (en) 2016-09-26 2023-02-14 Splunk Inc. Bucket data distribution for exporting data to worker nodes
US11321321B2 (en) 2016-09-26 2022-05-03 Splunk Inc. Record expansion and reduction based on a processing task in a data intake and query system
US10984044B1 (en) 2016-09-26 2021-04-20 Splunk Inc. Identifying buckets for query execution using a catalog of buckets stored in a remote shared storage system
US11562023B1 (en) 2016-09-26 2023-01-24 Splunk Inc. Merging buckets in a data intake and query system
US11586627B2 (en) 2016-09-26 2023-02-21 Splunk Inc. Partitioning and reducing records at ingest of a worker node
US10776355B1 (en) 2016-09-26 2020-09-15 Splunk Inc. Managing, storing, and caching query results and partial query results for combination with additional query results
US11860940B1 (en) 2016-09-26 2024-01-02 Splunk Inc. Identifying buckets for query execution using a catalog of buckets
US10956415B2 (en) 2016-09-26 2021-03-23 Splunk Inc. Generating a subquery for an external data system using a configuration file
US10977260B2 (en) 2016-09-26 2021-04-13 Splunk Inc. Task distribution in an execution node of a distributed execution environment
US11593377B2 (en) 2016-09-26 2023-02-28 Splunk Inc. Assigning processing tasks in a data intake and query system
US11599541B2 (en) 2016-09-26 2023-03-07 Splunk Inc. Determining records generated by a processing task of a query
US11163758B2 (en) 2016-09-26 2021-11-02 Splunk Inc. External dataset capability compensation
US10685279B2 (en) 2016-09-26 2020-06-16 Splunk Inc. Automatically generating field extraction recommendations
US10795884B2 (en) 2016-09-26 2020-10-06 Splunk Inc. Dynamic resource allocation for common storage query
US11294941B1 (en) 2016-09-26 2022-04-05 Splunk Inc. Message-based data ingestion to a data intake and query system
US12013895B2 (en) 2016-09-26 2024-06-18 Splunk Inc. Processing data using containerized nodes in a containerized scalable environment
US11567993B1 (en) 2016-09-26 2023-01-31 Splunk Inc. Copying buckets from a remote shared storage system to memory associated with a search node for query execution
US11663227B2 (en) 2016-09-26 2023-05-30 Splunk Inc. Generating a subquery for a distinct data intake and query system
US11093564B1 (en) 2016-09-26 2021-08-17 Splunk Inc. Identifying configuration parameters for a query using a metadata catalog
US10133588B1 (en) 2016-10-20 2018-11-20 Palantir Technologies Inc. Transforming instructions for collaborative updates
US10795887B1 (en) 2016-11-06 2020-10-06 Tableau Software, Inc. Dynamic rebuilding of query execution trees and reselection of query execution operators
US10318630B1 (en) 2016-11-21 2019-06-11 Palantir Technologies Inc. Analysis of large bodies of textual data
US10884875B2 (en) 2016-12-15 2021-01-05 Palantir Technologies Inc. Incremental backup of computer data files
US10223099B2 (en) 2016-12-21 2019-03-05 Palantir Technologies Inc. Systems and methods for peer-to-peer build sharing
CN108241721B (en) * 2016-12-27 2020-07-31 北京国双科技有限公司 Keyword-based article sorting method and device
US10684033B2 (en) 2017-01-06 2020-06-16 Johnson Controls Technology Company HVAC system with automated device pairing
US10067876B2 (en) 2017-01-09 2018-09-04 Splunk, Inc. Pre-fetching data from buckets in remote storage for a cache
CN107092628B (en) * 2017-01-10 2020-12-04 口碑控股有限公司 Time series data processing method and device
US11038905B2 (en) 2017-01-25 2021-06-15 Splunk, Inc. Identifying attack behavior based on scripting language activity
US11379530B2 (en) 2017-01-31 2022-07-05 Splunk Inc. Leveraging references values in inverted indexes to retrieve associated event records comprising raw machine data
US10375098B2 (en) 2017-01-31 2019-08-06 Splunk Inc. Anomaly detection based on relationships between multiple time series
US10474674B2 (en) 2017-01-31 2019-11-12 Splunk Inc. Using an inverted index in a pipelined search query to determine a set of event data that is further limited by filtering and/or processing of subsequent query pipestages
US11194794B2 (en) * 2017-01-31 2021-12-07 Splunk Inc. Search input recommendations
US11900287B2 (en) 2017-05-25 2024-02-13 Johnson Controls Tyco IP Holdings LLP Model predictive maintenance system with budgetary constraints
US10854194B2 (en) 2017-02-10 2020-12-01 Johnson Controls Technology Company Building system with digital twin based data ingestion and processing
US11360447B2 (en) 2017-02-10 2022-06-14 Johnson Controls Technology Company Building smart entity system with agent based communication and control
US10095756B2 (en) * 2017-02-10 2018-10-09 Johnson Controls Technology Company Building management system with declarative views of timeseries data
US20190095518A1 (en) 2017-09-27 2019-03-28 Johnson Controls Technology Company Web services for smart entity creation and maintenance using time series data
US11307538B2 (en) 2017-02-10 2022-04-19 Johnson Controls Technology Company Web services platform with cloud-eased feedback control
US11994833B2 (en) 2017-02-10 2024-05-28 Johnson Controls Technology Company Building smart entity system with agent based data ingestion and entity creation using time series data
US10452043B2 (en) 2017-02-10 2019-10-22 Johnson Controls Technology Company Building management system with nested stream generation
US11764991B2 (en) 2017-02-10 2023-09-19 Johnson Controls Technology Company Building management system with identity management
US10515098B2 (en) 2017-02-10 2019-12-24 Johnson Controls Technology Company Building management smart entity creation and maintenance using time series data
US10678216B2 (en) * 2017-02-28 2020-06-09 Sap Se Manufacturing process data collection and analytics
US10353379B2 (en) 2017-02-28 2019-07-16 Sap Se Manufacturing process data collection and analytics
US11042144B2 (en) 2017-03-24 2021-06-22 Johnson Controls Technology Company Building management system with dynamic channel communication
US11106713B2 (en) 2017-04-05 2021-08-31 Splunk Inc. Sampling data using inverted indexes in response to grouping selection
US10853399B2 (en) 2017-04-05 2020-12-01 Splunk Inc. User interface search tool for locating and summarizing data
US11061918B2 (en) 2017-04-05 2021-07-13 Splunk Inc. Locating and categorizing data using inverted indexes
US11102225B2 (en) 2017-04-17 2021-08-24 Splunk Inc. Detecting fraud by correlating user behavior biometrics with other data sources
US11315010B2 (en) 2017-04-17 2022-04-26 Splunk Inc. Neural networks for detecting fraud based on user behavior biometrics
US11372956B2 (en) 2017-04-17 2022-06-28 Splunk Inc. Multiple input neural networks for detecting fraud
US11327737B2 (en) 2017-04-21 2022-05-10 Johnson Controls Tyco IP Holdings LLP Building management system with cloud management of gateway configurations
US10698895B2 (en) 2017-04-21 2020-06-30 Splunk Inc. Skewing of scheduled search queries
US10838931B1 (en) * 2017-04-28 2020-11-17 EMC IP Holding Company LLC Use of stream-oriented log data structure for full-text search oriented inverted index metadata
US11074283B2 (en) 2017-04-28 2021-07-27 Splunk Inc. Linking data set summarizations using affinities
US10643214B2 (en) 2017-04-28 2020-05-05 Splunk Inc. Risk monitoring system
US10891284B2 (en) 2017-04-28 2021-01-12 Splunk Inc. Computing and replicating event deltas for mutable events in a distributed system
US11003691B2 (en) 2017-04-28 2021-05-11 Splunk Inc. Determining affinities for data set summarizations
US10509794B2 (en) 2017-04-28 2019-12-17 Splunk Inc. Dynamically-generated files for visualization sharing
US10497019B2 (en) 2017-04-28 2019-12-03 Splunk Inc. Geographic positions of mobile devices and external data sources correlation
US10788229B2 (en) 2017-05-10 2020-09-29 Johnson Controls Technology Company Building management system with a distributed blockchain database
US10896097B1 (en) 2017-05-25 2021-01-19 Palantir Technologies Inc. Approaches for backup and restoration of integrated databases
GB201708818D0 (en) 2017-06-02 2017-07-19 Palantir Technologies Inc Systems and methods for retrieving and processing data
US11022947B2 (en) 2017-06-07 2021-06-01 Johnson Controls Technology Company Building energy optimization system with economic load demand response (ELDR) optimization and ELDR user interfaces
US10956406B2 (en) 2017-06-12 2021-03-23 Palantir Technologies Inc. Propagated deletion of database records and derived data
WO2018232147A1 (en) 2017-06-15 2018-12-20 Johnson Controls Technology Company Building management system with artificial intelligence for unified agent based control of building subsystems
CA3067326A1 (en) * 2017-06-19 2018-12-27 Equifax Inc. Machine-learning system for servicing queries for digital content
WO2018233823A1 (en) * 2017-06-21 2018-12-27 Huawei Technologies Co., Ltd. Systems and methods for data aging based management of data entries
US11055284B1 (en) * 2017-06-30 2021-07-06 Tableau Software, Inc. Optimizing domain queries for relational databases
US11086943B2 (en) * 2017-07-17 2021-08-10 Ebay Inc. Bucket based distributed search system
WO2019018304A1 (en) 2017-07-17 2019-01-24 Johnson Controls Technology Company Systems and methods for agent based building simulation for optimal control
EP3655824A1 (en) 2017-07-21 2020-05-27 Johnson Controls Technology Company Building management system with dynamic work order generation with adaptive diagnostic task details
US10619882B2 (en) 2017-07-27 2020-04-14 Johnson Controls Technology Company Building management system with scorecard for building energy and equipment performance
US10565196B2 (en) 2017-07-29 2020-02-18 Splunk Inc. Determining a user-specific approach for disambiguation based on an interaction recommendation machine learning model
US11334552B2 (en) 2017-07-31 2022-05-17 Palantir Technologies Inc. Lightweight redundancy tool for performing transactions
US11921672B2 (en) 2017-07-31 2024-03-05 Splunk Inc. Query execution at a remote heterogeneous data store of a data fabric service
US11989194B2 (en) 2017-07-31 2024-05-21 Splunk Inc. Addressing memory limits for partition tracking among worker nodes
US12118009B2 (en) 2017-07-31 2024-10-15 Splunk Inc. Supporting query languages through distributed execution of query engines
US10417224B2 (en) 2017-08-14 2019-09-17 Palantir Technologies Inc. Time series database processing system
US10970648B2 (en) 2017-08-30 2021-04-06 International Business Machines Corporation Machine learning for time series using semantic and time series data
US11836198B2 (en) * 2017-09-05 2023-12-05 Drexel University Action-centered information retrieval
US10216695B1 (en) 2017-09-21 2019-02-26 Palantir Technologies Inc. Database system for time series data storage, processing, and analysis
US11106442B1 (en) 2017-09-23 2021-08-31 Splunk Inc. Information technology networked entity monitoring with metric selection prior to deployment
US11093518B1 (en) 2017-09-23 2021-08-17 Splunk Inc. Information technology networked entity monitoring with dynamic metric and threshold selection
US10678804B2 (en) 2017-09-25 2020-06-09 Splunk Inc. Cross-system journey monitoring based on relation of machine data
US10860618B2 (en) 2017-09-25 2020-12-08 Splunk Inc. Low-latency streaming analytics
US11159397B2 (en) 2017-09-25 2021-10-26 Splunk Inc. Lower-tier application deployment for higher-tier system data monitoring
US10693758B2 (en) 2017-09-25 2020-06-23 Splunk Inc. Collaborative incident management for networked computing systems
US10855793B2 (en) 2017-09-25 2020-12-01 Splunk Inc. Proxying hypertext transfer protocol (HTTP) requests for microservices
US11003682B2 (en) 2017-09-25 2021-05-11 Splunk Inc. Metrics analysis workflow
US11151137B2 (en) 2017-09-25 2021-10-19 Splunk Inc. Multi-partition operation in combination operations
US11469974B1 (en) 2017-09-25 2022-10-11 Splunk Inc. Analytics for edge devices to intelligently throttle data reporting
US10896182B2 (en) 2017-09-25 2021-01-19 Splunk Inc. Multi-partitioning determination for combination operations
US10853124B2 (en) 2017-09-25 2020-12-01 Splunk Inc. Managing user data in a multitenant deployment
US10771486B2 (en) * 2017-09-25 2020-09-08 Splunk Inc. Systems and methods for detecting network security threat event patterns
US10887369B2 (en) 2017-09-25 2021-01-05 Splunk Inc. Customizable load balancing in a user behavior analytics deployment
US11314788B2 (en) 2017-09-27 2022-04-26 Johnson Controls Tyco IP Holdings LLP Smart entity management for building management systems
US10962945B2 (en) 2017-09-27 2021-03-30 Johnson Controls Technology Company Building management system with integration of data into smart entities
US11258683B2 (en) 2017-09-27 2022-02-22 Johnson Controls Tyco IP Holdings LLP Web services platform with nested stream generation
US11195401B2 (en) 2017-09-27 2021-12-07 Johnson Controls Tyco IP Holdings LLP Building risk analysis system with natural language processing for threat ingestion
CN111566627B (en) 2017-11-09 2023-07-21 日本电信电话株式会社 Information accumulation device, data processing system, and recording medium
US11354301B2 (en) 2017-11-13 2022-06-07 LendingClub Bank, National Association Multi-system operation audit log
US10809682B2 (en) 2017-11-15 2020-10-20 Johnson Controls Technology Company Building management system with optimized processing of building system data
US11481383B2 (en) * 2017-11-15 2022-10-25 Sumo Logic, Inc. Key name synthesis
US11281169B2 (en) 2017-11-15 2022-03-22 Johnson Controls Tyco IP Holdings LLP Building management system with point virtualization for online meters
US11182434B2 (en) 2017-11-15 2021-11-23 Sumo Logic, Inc. Cardinality of time series
US11127235B2 (en) 2017-11-22 2021-09-21 Johnson Controls Tyco IP Holdings LLP Building campus with integrated smart environment
US11281726B2 (en) 2017-12-01 2022-03-22 Palantir Technologies Inc. System and methods for faster processor comparisons of visual graph features
US10614069B2 (en) 2017-12-01 2020-04-07 Palantir Technologies Inc. Workflow driven database partitioning
US11016986B2 (en) 2017-12-04 2021-05-25 Palantir Technologies Inc. Query-based time-series data display and processing system
US11074272B1 (en) 2017-12-21 2021-07-27 Seeq Corporation System and method for managing streaming calculations
CN108133043B (en) * 2018-01-12 2022-07-29 福建星瑞格软件有限公司 Structured storage method for server running logs based on big data
KR102023504B1 (en) * 2018-01-24 2019-09-20 연세대학교 산학협력단 Method and Apparatus for Detecting Pseudo Periodic Patterns of Real Time Data Streams
SG11202007063PA (en) * 2018-01-26 2020-08-28 Ge Inspection Technologies Lp Real time multi variate time series search
US10726079B2 (en) 2018-01-30 2020-07-28 Splunk Inc. Metric forecasting interface with alert prediction
US10997180B2 (en) 2018-01-31 2021-05-04 Splunk Inc. Dynamic query processor for streaming and batch queries
US10776194B2 (en) 2018-01-31 2020-09-15 Splunk Inc. Self-monitor for computing devices of a distributed computing system
CN108280015B (en) * 2018-02-07 2022-10-14 福建星瑞格软件有限公司 Cluster server log real-time processing method based on big data and computer equipment
KR102079439B1 (en) * 2018-02-22 2020-02-19 주식회사 에이티센스 Ecg measurement system having patch-type ecg measurement apparatus
US11954713B2 (en) 2018-03-13 2024-04-09 Johnson Controls Tyco IP Holdings LLP Variable refrigerant flow system with electricity consumption apportionment
US11244224B2 (en) * 2018-03-20 2022-02-08 International Business Machines Corporation Comparing time series data using context-based similarity
US10885049B2 (en) 2018-03-26 2021-01-05 Splunk Inc. User interface to identify one or more pivot identifiers and one or more step identifiers to process events
US10909128B2 (en) 2018-03-26 2021-02-02 Splunk Inc. Analyzing journey instances that include an ordering of step instances including a subset of a set of events
US10909182B2 (en) 2018-03-26 2021-02-02 Splunk Inc. Journey instance generation based on one or more pivot identifiers and one or more step identifiers
US10754822B1 (en) 2018-04-18 2020-08-25 Palantir Technologies Inc. Systems and methods for ontology migration
US11573955B1 (en) 2018-04-30 2023-02-07 Splunk Inc. Data-determinant query terms
US11238049B1 (en) 2018-04-30 2022-02-01 Splunk Inc. Revising catalog metadata based on parsing queries
US11157497B1 (en) 2018-04-30 2021-10-26 Splunk Inc. Dynamically assigning a search head and search nodes for a query
US11392578B1 (en) 2018-04-30 2022-07-19 Splunk Inc. Automatically generating metadata for a metadata catalog based on detected changes to the metadata catalog
US11327992B1 (en) 2018-04-30 2022-05-10 Splunk Inc. Authenticating a user to access a data intake and query system
US11334543B1 (en) 2018-04-30 2022-05-17 Splunk Inc. Scalable bucket merging for a data intake and query system
US11892996B1 (en) 2019-07-16 2024-02-06 Splunk Inc. Identifying an indexing node to process data using a resource catalog
US11275733B1 (en) 2018-04-30 2022-03-15 Splunk Inc. Mapping search nodes to a search head using a tenant identifier
GB201807534D0 (en) 2018-05-09 2018-06-20 Palantir Technologies Inc Systems and methods for indexing and searching
US11238012B1 (en) 2018-05-15 2022-02-01 Splunk Inc. Log data extraction from data chunks of an isolated execution environment
US11113301B1 (en) 2018-05-15 2021-09-07 Splunk Inc. Generating metadata for events based on parsed location information of data chunks of an isolated execution environment
CN110580241B (en) 2018-05-22 2023-09-01 微软技术许可有限责任公司 Preheating index files
ES2938488T3 (en) * 2018-06-06 2023-04-11 Siemens Ag Method and computing device for performing an interval search on numerical time series data
EP3579072A1 (en) * 2018-06-06 2019-12-11 Siemens Aktiengesellschaft Method for automatically generating labelled signatures
KR102232003B1 (en) * 2018-06-20 2021-03-25 주식회사 파수 Method for object management using trace identifier, apparatus for the same, computer program for the same, and recording medium storing computer program thereof
US11216432B2 (en) 2018-07-06 2022-01-04 Cfph, Llc Index data structures and graphical user interface
US11271939B2 (en) 2018-07-31 2022-03-08 Splunk Inc. Facilitating detection of suspicious access to resources
US10949420B2 (en) 2018-07-31 2021-03-16 Splunk Inc. Identifying similar field sets using related source types
US11100172B2 (en) 2018-07-31 2021-08-24 Splunk Inc. Providing similar field sets based on related source types
US10692299B2 (en) 2018-07-31 2020-06-23 Splunk Inc. Precise manipulation of virtual object position in an extended reality environment
US10833942B2 (en) 2018-07-31 2020-11-10 Splunk Inc. Behavioral based device clustering system and method
US11681944B2 (en) 2018-08-09 2023-06-20 Oracle International Corporation System and method to generate a labeled dataset for training an entity detection system
TWI682286B (en) * 2018-08-31 2020-01-11 愛酷智能科技股份有限公司 System for document searching using results of text analysis and natural language input
US11062042B1 (en) 2018-09-26 2021-07-13 Splunk Inc. Authenticating data associated with a data intake and query system using a distributed ledger system
US11106681B2 (en) 2018-09-28 2021-08-31 Splunk Inc. Conditional processing based on inferred sourcetypes
US11144185B1 (en) 2018-09-28 2021-10-12 Splunk Inc. Generating and providing concurrent journey visualizations associated with different journey definitions
US11537627B1 (en) 2018-09-28 2022-12-27 Splunk Inc. Information technology networked cloud service monitoring
US11762869B1 (en) 2018-09-28 2023-09-19 Splunk Inc. Generating journey flow visualization with node placement based on shortest distance to journey start
US11017764B1 (en) 2018-09-28 2021-05-25 Splunk Inc. Predicting follow-on requests to a natural language request received by a natural language processing system
US11074196B1 (en) 2018-09-28 2021-07-27 Splunk Inc. Evicting data associated with a data intake and query system from a local storage
US11226964B1 (en) 2018-09-28 2022-01-18 Splunk Inc. Automated generation of metrics from log data
US11366842B1 (en) 2018-09-28 2022-06-21 Splunk Inc. IT service monitoring by ingested machine data with KPI prediction and impactor determination
US10942774B1 (en) 2018-09-28 2021-03-09 Splunk Inc. Dynamic reassignment of search processes into workload pools in a search and indexing system
US10776441B1 (en) 2018-10-01 2020-09-15 Splunk Inc. Visual programming for iterative publish-subscribe message processing system
US10929415B1 (en) 2018-10-01 2021-02-23 Splunk Inc. Isolated execution environment system monitoring
US10761813B1 (en) 2018-10-01 2020-09-01 Splunk Inc. Assisted visual programming for iterative publish-subscribe message processing system
US10775976B1 (en) 2018-10-01 2020-09-15 Splunk Inc. Visual previews for programming an iterative publish-subscribe message processing system
US11775501B2 (en) 2018-10-26 2023-10-03 Splunk Inc. Trace and span sampling and analysis for instrumented software
US11016648B2 (en) 2018-10-30 2021-05-25 Johnson Controls Technology Company Systems and methods for entity visualization and management with an entity node editor
US10936585B1 (en) 2018-10-31 2021-03-02 Splunk Inc. Unified data processing across streaming and indexed data sets
JP7565267B2 (en) 2018-11-02 2024-10-10 シー3.エーアイ, インコーポレイテッド Systems and methods for full history dynamic network analysis
US11927925B2 (en) 2018-11-19 2024-03-12 Johnson Controls Tyco IP Holdings LLP Building system with a time correlated reliability data stream
US10394555B1 (en) * 2018-12-17 2019-08-27 Bakhtgerey Sinchev Computing network architecture for reducing a computing operation time and memory usage associated with determining, from a set of data elements, a subset of at least two data elements, associated with a target computing operation result
US11093364B2 (en) * 2018-12-18 2021-08-17 Micron Technology, Inc. Management of event log information of a memory sub-system
US11436567B2 (en) 2019-01-18 2022-09-06 Johnson Controls Tyco IP Holdings LLP Conference room management system
GB2595395A (en) * 2019-01-21 2021-11-24 Victory Software Ltd An online communication system
US10788798B2 (en) 2019-01-28 2020-09-29 Johnson Controls Technology Company Building management system with hybrid edge-cloud processing
US20200244524A1 (en) * 2019-01-29 2020-07-30 Hewlett Packard Enterprise Development Lp Network device monitors
US11816321B1 (en) 2019-01-31 2023-11-14 Splunk Inc. Enhancing extraction rules based on user feedback
US10997192B2 (en) 2019-01-31 2021-05-04 Splunk Inc. Data source correlation user interface
US11449293B1 (en) 2019-01-31 2022-09-20 Splunk Inc. Interface for data visualizations on a wearable device
US10963347B1 (en) 2019-01-31 2021-03-30 Splunk Inc. Data snapshots for configurable screen on a wearable device
US11644940B1 (en) 2019-01-31 2023-05-09 Splunk Inc. Data visualization in an extended reality environment
US10891792B1 (en) 2019-01-31 2021-01-12 Splunk Inc. Precise plane detection and placement of virtual objects in an augmented reality environment
US11086890B1 (en) 2019-01-31 2021-08-10 Splunk Inc. Extraction rule validation
US10938634B1 (en) 2019-01-31 2021-03-02 Splunk Inc. System and method for centralized analytics for edge devices with enrichment pushdown
US10846062B1 (en) 2019-01-31 2020-11-24 Splunk Inc. Multi-prompt blocks for a visual playbook editor
US11321392B2 (en) * 2019-02-19 2022-05-03 International Business Machines Corporation Light weight index for querying low-frequency data in a big data environment
US10936643B1 (en) 2019-02-22 2021-03-02 Splunk Inc. User interface with automated condensation of machine data event streams
US11216466B2 (en) 2019-04-24 2022-01-04 Ebay Inc. Managing database offsets with time series
WO2020220216A1 (en) 2019-04-29 2020-11-05 Splunk Inc. Search time estimate in data intake and query system
US10754638B1 (en) 2019-04-29 2020-08-25 Splunk Inc. Enabling agile functionality updates using multi-component application
US11194564B1 (en) 2019-04-29 2021-12-07 Splunk Inc. Maintaining compatibility in a multi-component application
US11416285B1 (en) 2019-04-30 2022-08-16 Splunk Inc. Efficient and secure scalable-two-stage data collection
US10853082B1 (en) 2019-04-30 2020-12-01 Splunk Inc. Pipeline set selection based on duty cycle estimation of pipeline threads
CN111859028B (en) * 2019-04-30 2024-08-16 伊姆西Ip控股有限责任公司 Method, apparatus and computer program product for creating an index for streaming storage
US11715051B1 (en) 2019-04-30 2023-08-01 Splunk Inc. Service provider instance recommendations using machine-learned classifications and reconciliation
US10922892B1 (en) 2019-04-30 2021-02-16 Splunk Inc. Manipulation of virtual object position within a plane of an extended reality environment
US11238044B2 (en) 2019-05-06 2022-02-01 International Business Machines Corporation Candidate data record prioritization for match processing
US11507562B1 (en) 2019-05-22 2022-11-22 Splunk Inc. Associating data from different nodes of a distributed ledger system
US11269859B1 (en) 2019-05-22 2022-03-08 Splunk Inc. Correlating different types of data of a distributed ledger system
US11170029B2 (en) * 2019-05-31 2021-11-09 Lendingclub Corporation Multi-user cross-device tracking
US11250018B2 (en) 2019-06-25 2022-02-15 Periscope Data Inc. Method for automated query language expansion and indexing
US11797615B2 (en) 2019-07-02 2023-10-24 International Business Machines Corporation Random sampling from a search engine
CN110427368B (en) * 2019-07-12 2022-07-12 深圳绿米联创科技有限公司 Data processing method and device, electronic equipment and storage medium
US11636128B1 (en) 2019-07-16 2023-04-25 Splunk Inc. Displaying query results from a previous query when accessing a panel
US11113294B1 (en) 2019-07-16 2021-09-07 Splunk Inc. Recommending query templates during query formation
US11386158B1 (en) 2019-07-16 2022-07-12 Splunk Inc. Recommending query parameters based on tenant information
US11216511B1 (en) 2019-07-16 2022-01-04 Splunk Inc. Executing a child query based on results of a parent query
US11416465B1 (en) 2019-07-16 2022-08-16 Splunk Inc. Processing data associated with different tenant identifiers
US11604799B1 (en) 2019-07-16 2023-03-14 Splunk Inc. Performing panel-related actions based on user interaction with a graphical user interface
US11238048B1 (en) 2019-07-16 2022-02-01 Splunk Inc. Guided creation interface for streaming data processing pipelines
US11263268B1 (en) 2019-07-16 2022-03-01 Splunk Inc. Recommending query parameters based on the results of automatically generated queries
US11644955B1 (en) 2019-07-16 2023-05-09 Splunk Inc. Assigning a global parameter to queries in a graphical user interface
US11269871B1 (en) 2019-07-16 2022-03-08 Splunk Inc. Displaying multiple editable queries in a graphical user interface
CN112307360B (en) * 2019-07-30 2023-08-25 百度在线网络技术(北京)有限公司 Regional event detection method and device based on search engine and search engine
US11182576B1 (en) 2019-07-31 2021-11-23 Splunk Inc. Techniques for using tag placement to determine 3D object orientation
US11397715B2 (en) 2019-07-31 2022-07-26 International Business Machines Corporation Defining indexing fields for matching data entities
US11269872B1 (en) 2019-07-31 2022-03-08 Splunk Inc. Intent-based natural language processing system
US11283693B2 (en) * 2019-08-12 2022-03-22 Microsoft Technology Licensing, Llc Summarized event data responsive to a query
US20210056127A1 (en) * 2019-08-21 2021-02-25 Nec Laboratories America, Inc. Method for multi-modal retrieval and clustering using deep cca and active pairwise queries
USD949234S1 (en) 2019-09-18 2022-04-19 Splunk Inc. Typeface
USD968505S1 (en) 2019-09-18 2022-11-01 Splunk Inc. Typeface
US11494380B2 (en) 2019-10-18 2022-11-08 Splunk Inc. Management of distributed computing framework components in a data fabric service system
US10985970B1 (en) 2019-10-18 2021-04-20 Splunk Inc. Automatic actionable event responder for operational clusters
US11676345B1 (en) 2019-10-18 2023-06-13 Splunk Inc. Automated adaptive workflows in an extended reality environment
US11582036B1 (en) 2019-10-18 2023-02-14 Splunk Inc. Scaled authentication of endpoint devices
US11151125B1 (en) 2019-10-18 2021-10-19 Splunk Inc. Efficient updating of journey instances detected within unstructured event data
US11036456B1 (en) 2019-10-18 2021-06-15 Splunk Inc. Control of a display device included in a display grid
US11620157B2 (en) 2019-10-18 2023-04-04 Splunk Inc. Data ingestion pipeline anomaly detection
US11263229B1 (en) 2019-10-18 2022-03-01 Splunk Inc. Efficient detection of alert states within unstructured event data based on evaluation of structured data set
US11275944B1 (en) 2019-10-18 2022-03-15 Splunk Inc. External asset database management in an extended reality environment
US11217023B1 (en) 2019-10-18 2022-01-04 Splunk Inc. Generating three-dimensional data visualizations in an extended reality environment
US11599549B2 (en) 2019-10-18 2023-03-07 Splunk Inc. Sampling-based preview mode for a data intake and query system
US11579764B1 (en) 2019-10-21 2023-02-14 Splunk Inc. Interfaces for data monitoring and event response
CN110764706B (en) * 2019-10-25 2023-10-03 普联技术有限公司 Storage system, data management method, and storage medium
CN115210700A (en) 2019-12-31 2022-10-18 江森自控泰科知识产权控股有限责任合伙公司 Building data platform
US20210200174A1 (en) 2019-12-31 2021-07-01 Johnson Controls Technology Company Building information model management system with hierarchy generation
US11769066B2 (en) 2021-11-17 2023-09-26 Johnson Controls Tyco IP Holdings LLP Building data platform with digital twin triggers and actions
US11894944B2 (en) 2019-12-31 2024-02-06 Johnson Controls Tyco IP Holdings LLP Building data platform with an enrichment loop
US12021650B2 (en) 2019-12-31 2024-06-25 Tyco Fire & Security Gmbh Building data platform with event subscriptions
US11302083B1 (en) 2020-01-29 2022-04-12 Splunk Inc. Web-based three-dimensional extended reality workspace editor
US11922222B1 (en) 2020-01-30 2024-03-05 Splunk Inc. Generating a modified component for a data intake and query system using an isolated execution environment image
US11089107B1 (en) 2020-01-31 2021-08-10 Splunk Inc. Management of connected sensor devices
US12081418B2 (en) 2020-01-31 2024-09-03 Splunk Inc. Sensor data device
US11403157B1 (en) 2020-01-31 2022-08-02 Splunk Inc. Identifying a root cause of an error
US11436116B1 (en) 2020-01-31 2022-09-06 Splunk Inc. Recovering pre-indexed data from a shared storage system following a failed indexer
US11829415B1 (en) 2020-01-31 2023-11-28 Splunk Inc. Mapping buckets and search peers to a bucket map identifier for searching
US11055925B1 (en) 2020-01-31 2021-07-06 Splunk Inc. Techniques for placement of extended reality objects relative to physical objects in an extended reality environment
US11276236B1 (en) 2020-01-31 2022-03-15 Splunk Inc. Techniques for placement of extended reality objects relative to other extended reality objects in an extended reality environment
US12100280B2 (en) 2020-02-04 2024-09-24 Tyco Fire & Security Gmbh Systems and methods for software defined fire detection and risk assessment
US11403324B2 (en) * 2020-02-15 2022-08-02 Near Intelligence Holdings, Inc. Method for real-time cohort creation based on entity attributes derived from partially observable location data
US11762858B2 (en) 2020-03-19 2023-09-19 The Mitre Corporation Systems and methods for analyzing distributed system data streams using declarative specification, detection, and evaluation of happened-before relationships
US11537386B2 (en) 2020-04-06 2022-12-27 Johnson Controls Tyco IP Holdings LLP Building system with dynamic configuration of network resources for 5G networks
US11574461B2 (en) 2020-04-07 2023-02-07 Nec Corporation Time-series based analytics using video streams
US11853330B1 (en) 2020-04-30 2023-12-26 Splunk Inc. Data structure navigator
US11614923B2 (en) 2020-04-30 2023-03-28 Splunk Inc. Dual textual/graphical programming interfaces for streaming data processing pipelines
US11153325B1 (en) 2020-04-30 2021-10-19 Splunk Inc. Server-based restricted access storage
US11809447B1 (en) 2020-04-30 2023-11-07 Splunk Inc. Collapsing nodes within a journey model
US11269876B1 (en) * 2020-04-30 2022-03-08 Splunk Inc. Supporting graph data structure transformations in graphs generated from a query to event data
CN111291235A (en) * 2020-05-13 2020-06-16 成都四方伟业软件股份有限公司 Metadata storage method and device based on time sequence database
WO2021232292A1 (en) * 2020-05-20 2021-11-25 深圳市欢太科技有限公司 Log data processing method and related product
US11874809B2 (en) 2020-06-08 2024-01-16 Johnson Controls Tyco IP Holdings LLP Building system with naming schema encoding entity type and entity relationships
CN111782872B (en) * 2020-06-30 2023-08-08 北京百度网讯科技有限公司 Query method, query device, electronic equipment and storage medium
US11741131B1 (en) 2020-07-31 2023-08-29 Splunk Inc. Fragmented upload and re-stitching of journey instances detected within event data
US11847133B1 (en) 2020-07-31 2023-12-19 Splunk Inc. Real-time collaborative data visualization and interaction
US11615082B1 (en) 2020-07-31 2023-03-28 Splunk Inc. Using a data store and message queue to ingest data for a data intake and query system
US11704490B2 (en) 2020-07-31 2023-07-18 Splunk Inc. Log sourcetype inference model training for a data intake and query system
US11449371B1 (en) 2020-07-31 2022-09-20 Splunk Inc. Indexing data at a data intake and query system based on a node capacity threshold
US11487513B1 (en) 2020-07-31 2022-11-01 Splunk Inc. Reusable custom functions for playbooks
US11762442B1 (en) 2020-07-31 2023-09-19 Splunk Inc. Real-time machine learning at an edge of a distributed network
US11663176B2 (en) 2020-07-31 2023-05-30 Splunk Inc. Data field extraction model training for a data intake and query system
US12079304B1 (en) 2020-08-11 2024-09-03 Splunk Inc. Online data forecasting
US11729074B1 (en) 2020-08-11 2023-08-15 Splunk Inc. Online data decomposition
US12079233B1 (en) 2020-08-11 2024-09-03 Splunk Inc. Multiple seasonality online data decomposition
US20220067109A1 (en) * 2020-08-28 2022-03-03 Aera Technology, Inc. Cognitive automation platform
KR20220031256A (en) 2020-09-04 2022-03-11 주식회사 모카앤제이에스 Apparatus and method for controlling document transmission
US11620164B1 (en) 2020-09-25 2023-04-04 Splunk Inc. Virtual partitioning of a shared message bus
US11397773B2 (en) 2020-09-30 2022-07-26 Johnson Controls Tyco IP Holdings LLP Building management system with semantic model integration
US11954154B2 (en) 2020-09-30 2024-04-09 Johnson Controls Tyco IP Holdings LLP Building management system with semantic model integration
US11392605B1 (en) 2020-09-30 2022-07-19 Splunk Inc. Integration in computer analytics system
US11321217B1 (en) 2020-10-06 2022-05-03 Splunk Inc. Generating metrics values at component levels of a monolithic application and of a microservice of a microservices-based architecture
US11347622B1 (en) 2020-10-06 2022-05-31 Splunk Inc. Generating metrics values for teams of microservices of a microservices-based architecture
US11620303B1 (en) 2020-10-09 2023-04-04 Splunk Inc. Security essentials and information technology essentials for a data intake and query system
US11909750B1 (en) 2020-10-15 2024-02-20 Splunk Inc. Data reduction and evaluation via link analysis
US11551421B1 (en) 2020-10-16 2023-01-10 Splunk Inc. Mesh updates via mesh frustum cutting
US11144336B1 (en) 2020-10-16 2021-10-12 Splunk Inc. Customization tool for dashboards
US11250069B1 (en) 2020-10-16 2022-02-15 Splunk Inc. Related content identification for different types of machine-generated data
US11727643B1 (en) 2020-10-16 2023-08-15 Splunk Inc. Multi-environment networked remote collaboration system
US11563813B1 (en) 2020-10-16 2023-01-24 Splunk Inc. Presentation of collaboration environments for a networked remote collaboration session
US11669551B2 (en) 2020-10-16 2023-06-06 Splunk Inc. Rule-based data stream processing
US11544343B1 (en) 2020-10-16 2023-01-03 Splunk Inc. Codeless anchor generation for detectable features in an environment
US11816801B1 (en) 2020-10-16 2023-11-14 Splunk Inc. Codeless anchor generation for three-dimensional object models
US11544904B1 (en) 2020-10-16 2023-01-03 Splunk Inc. Mesh updates in an extended reality environment
US11798235B1 (en) 2020-10-16 2023-10-24 Splunk Inc. Interactions in networked remote collaboration environments
US11546437B1 (en) 2020-10-16 2023-01-03 Splunk Inc. Playback of a stored networked remote collaboration session
US11127223B1 (en) 2020-10-16 2021-09-21 Splunkinc. Mesh updates via mesh splitting
US11843622B1 (en) 2020-10-16 2023-12-12 Splunk Inc. Providing machine learning models for classifying domain names for malware detection
US11609913B1 (en) 2020-10-16 2023-03-21 Splunk Inc. Reassigning data groups from backup to searching for a processing node
US11776218B1 (en) 2020-10-16 2023-10-03 Splunk Inc. Networked remote collaboration system
US11388211B1 (en) * 2020-10-16 2022-07-12 Splunk Inc. Filter generation for real-time data stream
US11347625B1 (en) 2020-10-19 2022-05-31 Splunk Inc. Rendering a service graph illustrate page provider dependencies at query time using exemplars
US11947988B1 (en) 2020-10-19 2024-04-02 Splunk Inc. Load balancer bypass for direct ingestion of data into a data intake and query system
US11567735B1 (en) 2020-10-19 2023-01-31 Splunk Inc. Systems and methods for integration of multiple programming languages within a pipelined search query
US11789950B1 (en) 2020-10-19 2023-10-17 Splunk Inc. Dynamic storage and deferred analysis of data stream events
US12079175B2 (en) 2020-10-19 2024-09-03 Splunk Inc. Streaming synthesis of distributed traces from machine logs
US11500871B1 (en) 2020-10-19 2022-11-15 Splunk Inc. Systems and methods for decoupling search processing language and machine learning analytics from storage of accessed data
US11727007B1 (en) 2020-10-19 2023-08-15 Splunk Inc. Systems and methods for a unified analytics platform
US11693850B1 (en) 2020-10-19 2023-07-04 Splunk Inc. Rendering a service graph to illustrate page provider dependencies at an aggregate level
US11748634B1 (en) 2020-10-19 2023-09-05 Splunk Inc. Systems and methods for integration of machine learning components within a pipelined search query to generate a graphic visualization
US11704313B1 (en) 2020-10-19 2023-07-18 Splunk Inc. Parallel branch operation using intermediary nodes
US11861767B1 (en) 2020-10-19 2024-01-02 Splunk Inc. Streaming data visualizations
CN114490518A (en) * 2020-10-23 2022-05-13 伊姆西Ip控股有限责任公司 Method, apparatus and program product for managing indexes of a streaming data storage system
US11704285B1 (en) 2020-10-29 2023-07-18 Splunk Inc. Metrics and log integration
US11675771B1 (en) 2020-10-29 2023-06-13 Splunk Inc. Identity resolution
US11552974B1 (en) 2020-10-30 2023-01-10 Splunk Inc. Cybersecurity risk analysis and mitigation
US11860858B1 (en) 2020-10-30 2024-01-02 Splunk Inc. Decoding distributed ledger transaction records
US11516069B1 (en) 2020-10-30 2022-11-29 Splunk Inc. Aggregate notable events in an information technology and security operations application
US11522812B1 (en) 2020-10-30 2022-12-06 Splunk Inc. Workflows for configuring the ingestion of user data from a service provider network into a data intake and query system
US12058212B2 (en) 2020-10-30 2024-08-06 Tyco Fire & Security Gmbh Building management system with auto-configuration using existing points
US12061453B2 (en) 2020-12-18 2024-08-13 Tyco Fire & Security Gmbh Building management system performance index
US11797501B2 (en) * 2020-12-21 2023-10-24 Vmware, Inc. Methods and systems for aggregating and querying log messages
US11516419B2 (en) 2021-01-22 2022-11-29 Omnivision Technologies, Inc. Digital time stamping design for event driven pixel
US11892976B2 (en) 2021-01-29 2024-02-06 Splunk Inc. Enhanced search performance using data model summaries stored in a remote data store
US11892988B1 (en) 2021-01-29 2024-02-06 Splunk Inc. Content pack management
US11714683B1 (en) 2021-01-29 2023-08-01 Splunk Inc. Information technology and security application automation architecture
US11687438B1 (en) 2021-01-29 2023-06-27 Splunk Inc. Adaptive thresholding of data streamed to a data processing pipeline
US11836146B1 (en) 2021-01-29 2023-12-05 Splunk Inc. Storing indexed fields per source type as metadata at the bucket level to facilitate search-time field learning
US11675816B1 (en) 2021-01-29 2023-06-13 Splunk Inc. Grouping evens into episodes using a streaming data processor
US11676072B1 (en) 2021-01-29 2023-06-13 Splunk Inc. Interface for incorporating user feedback into training of clustering model
US11402979B1 (en) 2021-01-29 2022-08-02 Splunk Inc. Interactive expandable histogram timeline module for security flagged events
US11782920B1 (en) 2021-01-29 2023-10-10 Splunk Inc. Durable search queries for reliable distributed data retrieval
US12099428B1 (en) 2021-01-29 2024-09-24 Splunk Inc. Multiple modalities of data collection and analysis for real-time monitoring (RUM) in a microservices-based architecture
US11695803B2 (en) 2021-01-29 2023-07-04 Splunk Inc. Extension framework for an information technology and security operations application
US11494381B1 (en) 2021-01-29 2022-11-08 Splunk Inc. Ingestion and processing of both cloud-based and non-cloud-based data by a data intake and query system
US11841827B2 (en) 2021-01-29 2023-12-12 Splunk Inc. Facilitating generation of data model summaries
US11636116B2 (en) 2021-01-29 2023-04-25 Splunk Inc. User interface for customizing data streams
US12038926B1 (en) 2021-01-29 2024-07-16 Splunk Inc. Intelligent search-time determination and usage of fields extracted at index-time
CN112818039B (en) * 2021-02-04 2022-04-26 浙江邦盛科技有限公司 Precise window processing method for solving out-of-order events and late arrival data
US20220253470A1 (en) * 2021-02-05 2022-08-11 SparkCognition, Inc. Model-based document search
US11734886B1 (en) 2021-02-18 2023-08-22 Splunk Inc. Interaction tools in networked remote collaboration
US12086920B1 (en) 2021-02-18 2024-09-10 Splunk Inc. Submesh-based updates in an extended reality environment
US12112435B1 (en) 2021-02-18 2024-10-08 Splunk Inc. Collaboration spaces in extended reality conference sessions
US11915377B1 (en) 2021-02-18 2024-02-27 Splunk Inc. Collaboration spaces in networked remote collaboration sessions
US12106419B1 (en) 2021-02-18 2024-10-01 Splunk Inc. Live updates in a networked remote collaboration session
US11687487B1 (en) 2021-03-11 2023-06-27 Splunk Inc. Text files updates to an active processing pipeline
CN117280291A (en) 2021-03-17 2023-12-22 江森自控泰科知识产权控股有限责任合伙公司 System and method for determining device energy waste
US11558412B1 (en) 2021-03-29 2023-01-17 Splunk Inc. Interactive security visualization of network entity data
US11507557B2 (en) * 2021-04-02 2022-11-22 International Business Machines Corporation Dynamic sampling of streaming data using finite memory
CN112766426B (en) * 2021-04-06 2021-09-07 中国铁道科学研究院集团有限公司通信信号研究所 Fault type identification method based on time constraint
US11949547B2 (en) 2021-04-08 2024-04-02 Splunk Inc. Enhanced simple network management protocol (SNMP) connector
US11716405B1 (en) 2021-04-14 2023-08-01 Splunk Inc. System and method for identifying cache miss in backend application
US11582316B1 (en) 2021-04-15 2023-02-14 Splunk Inc. URL normalization for rendering a service graph and aggregating metrics associated with a real user session
US11727016B1 (en) 2021-04-15 2023-08-15 Splunk Inc. Surfacing and displaying exemplary spans from a real user session in response to a query
CN113051234A (en) * 2021-04-19 2021-06-29 国际关系学院 Mobile on-site big data analysis platform
US11663219B1 (en) 2021-04-23 2023-05-30 Splunk Inc. Determining a set of parameter values for a processing pipeline
US11755344B1 (en) 2021-04-27 2023-09-12 Splunk Inc. Dynamic drill-down of nested graphical interfaces
US11663109B1 (en) 2021-04-30 2023-05-30 Splunk Inc. Automated seasonal frequency identification
US12020077B1 (en) 2021-04-30 2024-06-25 Splunk Inc. Autoscaling in a stateful system
US11714823B1 (en) 2021-04-30 2023-08-01 Splunk Inc. Generating metric data from log data using metricization rules
US11671457B2 (en) 2021-04-30 2023-06-06 Splunk Inc. On-premises action execution agent for cloud-based information technology and security operations applications
US11604789B1 (en) 2021-04-30 2023-03-14 Splunk Inc. Bi-directional query updates in a user interface
US11675473B1 (en) 2021-04-30 2023-06-13 Splunk Inc. User interface for summarizing data to generate previews of metric data
US11526504B1 (en) 2021-04-30 2022-12-13 Splunk Inc. Search-time field extraction in a data intake and query system
US11736452B1 (en) 2021-04-30 2023-08-22 Splunk Inc. Secure update of dashboard properties
US11720591B1 (en) 2021-04-30 2023-08-08 Splunk Inc. Virtual metrics
US11934256B1 (en) 2021-06-01 2024-03-19 Splunk Inc. Determining ingestion latency of data intake and query system instances
US11915044B2 (en) 2021-06-09 2024-02-27 Splunk Inc. Distributed task assignment in a cluster computing system
US11899723B2 (en) 2021-06-22 2024-02-13 Johnson Controls Tyco IP Holdings LLP Building data platform with context based twin function processing
EP4152174A4 (en) * 2021-06-23 2023-11-29 Beijing Baidu Netcom Science Technology Co., Ltd. Data processing method and apparatus, and computing device and medium
US11714799B1 (en) 2021-06-30 2023-08-01 Splunk Inc. Automated testing of add-on configurations for searching event data using a late-binding schema
US11941421B1 (en) 2021-07-09 2024-03-26 Splunk Inc. Evaluating and scaling a collection of isolated execution environments at a particular geographic location
US11809395B1 (en) 2021-07-15 2023-11-07 Splunk Inc. Load balancing, failover, and reliable delivery of data in a data intake and query system
US11645252B2 (en) 2021-07-23 2023-05-09 Bank Of America Corporation System and method for efficiently validating time-series data using a hash-based representation of the data
US11640389B2 (en) 2021-07-23 2023-05-02 Bank Of America Corporation Hash-based identification of data corruption issues in time-series data
US11860760B1 (en) 2021-07-30 2024-01-02 Splunk Inc. Aggregating metrics for workflows associated with a real user session
US12072939B1 (en) 2021-07-30 2024-08-27 Splunk Inc. Federated data enrichment objects
US11989592B1 (en) 2021-07-30 2024-05-21 Splunk Inc. Workload coordinator for providing state credentials to processing tasks of a data processing pipeline
JP2023028252A (en) * 2021-08-19 2023-03-03 株式会社日立製作所 Data management device and program
WO2023056055A1 (en) * 2021-09-30 2023-04-06 The Johns Hopkins University Systems and methods for disambiguating time-series data generated by monitoring devices
US11704219B1 (en) 2021-10-04 2023-07-18 Splunk Inc. Performance monitoring of distributed ledger nodes
US11789804B1 (en) 2021-10-18 2023-10-17 Splunk Inc. Identifying the root cause of failure observed in connection to a workflow
US12013879B2 (en) 2021-10-18 2024-06-18 Splunk Inc. Dynamic resolution estimation in metric time series data
US12056169B1 (en) 2021-10-28 2024-08-06 Splunk Inc. Systems and methods for DNS text classification
US11641310B1 (en) 2021-10-29 2023-05-02 Splunk Inc. Entity lifecycle management in service monitoring system
US11886451B2 (en) 2021-10-29 2024-01-30 Splunk Inc. Quantization of data streams of instrumented software and handling of delayed data by adjustment of a maximum delay
US11886453B2 (en) 2021-10-29 2024-01-30 Splunk Inc. Quantization of data streams of instrumented software and handling of delayed or late data
US12061691B2 (en) 2021-10-29 2024-08-13 Splunk Inc. Graphical user interface for presentation of network security risk and threat information
US11831521B1 (en) 2021-10-29 2023-11-28 Splunk Inc. Entity lifecycle management in service monitoring system
US11796974B2 (en) 2021-11-16 2023-10-24 Johnson Controls Tyco IP Holdings LLP Building data platform with schema extensibility for properties and tags of a digital twin
US11934966B2 (en) 2021-11-17 2024-03-19 Johnson Controls Tyco IP Holdings LLP Building data platform with digital twin inferences
US11704311B2 (en) 2021-11-24 2023-07-18 Johnson Controls Tyco IP Holdings LLP Building data platform with a distributed digital twin
US11714930B2 (en) 2021-11-29 2023-08-01 Johnson Controls Tyco IP Holdings LLP Building data platform with digital twin based inferences and predictions for a graphical building model
US12013673B2 (en) 2021-11-29 2024-06-18 Tyco Fire & Security Gmbh Building control system using reinforcement learning
US11907227B1 (en) 2021-12-03 2024-02-20 Splunk Inc. System and method for changepoint detection in streaming data
US11546142B1 (en) 2021-12-22 2023-01-03 Bakhtgerey Sinchev Cryptography key generation method for encryption and decryption
CN114417817B (en) * 2021-12-30 2023-05-16 中国电信股份有限公司 Session information cutting method and device
US11947528B1 (en) 2022-01-06 2024-04-02 Splunk Inc. Automatic generation of queries using non-textual input
US11936545B1 (en) 2022-01-11 2024-03-19 Splunk Inc. Systems and methods for detecting beaconing communications in aggregated traffic data
US12088611B1 (en) 2022-01-11 2024-09-10 Splunk Inc. Systems and methods for training a machine learning model to detect beaconing communications
US12050507B1 (en) 2022-01-24 2024-07-30 Splunk Inc. System and method for data ingestion, anomaly detection and notification
US11714698B1 (en) 2022-01-28 2023-08-01 Splunk Inc. System and method for machine-learning based alert prioritization
US11954541B1 (en) 2022-01-28 2024-04-09 Splunk Inc. Highly available message ingestion by a data intake and query system
US12079100B1 (en) 2022-01-31 2024-09-03 Splunk Inc. Systems and methods for machine-learning based alert grouping and providing remediation recommendations
US11902081B1 (en) 2022-01-31 2024-02-13 Splunk Inc. Managing collection agents via an agent controller
US12120124B1 (en) 2022-01-31 2024-10-15 Splunk Inc. Live app testing within an app editor for an information technology and security operations application
US20230280721A1 (en) * 2022-03-03 2023-09-07 Model-Prime, Inc. Distributed robotics log processing
US11630624B1 (en) 2022-03-14 2023-04-18 Ricoh Company, Ltd. Concurrent event detection for distributed rasterization processing in a printing environment
US11818087B1 (en) 2022-03-21 2023-11-14 Splunk Inc. User-to-user messaging-based software troubleshooting tool
US12072907B1 (en) 2022-03-29 2024-08-27 Splunk Inc. Graphical user interface for time-based data visualization
CN115037729B (en) * 2022-04-21 2024-05-28 中国建设银行股份有限公司 Data aggregation method, device, electronic equipment and computer readable medium
US11835989B1 (en) 2022-04-21 2023-12-05 Splunk Inc. FPGA search in a cloud compute node
US12028226B1 (en) 2022-04-26 2024-07-02 Splunk Inc. Content pack management in service monitoring system
US11829378B1 (en) 2022-04-29 2023-11-28 Splunk Inc. Automated generation of insights for machine generated data
US12086451B1 (en) 2022-04-29 2024-09-10 Splunk Inc. Automated downscaling of data stores
US12093272B1 (en) 2022-04-29 2024-09-17 Splunk Inc. Retrieving data identifiers from queue for search of external data system
US11983181B1 (en) 2022-06-10 2024-05-14 Splunk Inc. Data stream processing instruction set previews using batch conversion
US12008046B1 (en) 2022-06-10 2024-06-11 Splunk Inc. System and method for automated determination of search query parameters for anomaly detection
US11956133B1 (en) 2022-06-22 2024-04-09 Splunk Inc. Navigator performance optimization and signalflow query acceleration
WO2024015645A1 (en) * 2022-07-15 2024-01-18 Imply Data, Inc. Query-time data sessionization and analysis
US11966388B1 (en) 2022-07-29 2024-04-23 Splunk Inc. Updating and executing search queries of multiple independent panels of a dashboard
US12073103B1 (en) 2022-07-29 2024-08-27 Splunk Inc. Multiple storage system event handling
US11886845B1 (en) 2022-07-29 2024-01-30 Splunk, Inc. Computer dashboard editing tool
US12061533B1 (en) 2022-07-29 2024-08-13 Splunk Inc. Ingest health monitoring
US11915315B1 (en) * 2022-08-10 2024-02-27 Bgc Partners, L.P. Method, apparatus and system for time stamping and sequencing data items
US12061633B2 (en) 2022-09-08 2024-08-13 Tyco Fire & Security Gmbh Building system that maps points into a graph schema
US12013823B2 (en) 2022-09-08 2024-06-18 Tyco Fire & Security Gmbh Gateway system that maps points into a graph schema
US12072783B1 (en) 2022-10-25 2024-08-27 Splunk Inc. Performing iterative entity discovery and instrumentation
US11755453B1 (en) 2022-10-25 2023-09-12 Splunk Inc. Performing iterative entity discovery and instrumentation
US11895192B1 (en) 2022-10-26 2024-02-06 Splunk Inc. Managing subscriptions to resource updates made via a target interface
US12072859B1 (en) 2022-10-26 2024-08-27 Splunk Inc. Accessibility controls for manipulating data visualizations
US11838189B1 (en) 2022-10-28 2023-12-05 Splunk Inc. Creating a budget-based time series
US11824938B1 (en) 2022-10-28 2023-11-21 Splunk Inc. External sensor integration at edge device for delivery of data to intake system
US12111874B1 (en) 2022-12-28 2024-10-08 Splunk Inc. Exploratory data analysis system for automated generation of search queries using machine learning techniques to identify certain log fields and correlation thereof
US12072913B1 (en) 2023-01-31 2024-08-27 Splunk Inc. Unhandled data protection for programmatic input/output routing to datasets with user-defined partitions
US12038993B1 (en) 2023-01-31 2024-07-16 Splunk Inc. Techniques for showing matched URLs for a URL grouping rule
US11907097B1 (en) 2023-01-31 2024-02-20 Splunk Inc. Techniques for processing trace data
US11924284B1 (en) 2023-05-31 2024-03-05 Splunk Inc. Automated security, orchestration, automation, and response (SOAR) app generation based on application programming interface specification data
CN116521094B (en) * 2023-07-03 2023-11-14 之江实验室 Metadata storage method and device, computer equipment and storage medium

Citations (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5347540A (en) * 1992-07-08 1994-09-13 Tektronix, Inc. Dynamic storage allocation in a logic analyzer
US5613113A (en) * 1993-10-08 1997-03-18 International Business Machines Corporation Consistent recreation of events from activity logs
US5737600A (en) * 1994-09-12 1998-04-07 International Business Machines Corporation Method and system for log management in a coupled data processing system
US5745693A (en) * 1992-07-01 1998-04-28 Mci Corporation System for gathering and reporting real time data from an IDNX communications network
US5847972A (en) * 1993-09-24 1998-12-08 Eick; Stephen Gregory Method and apparatus for graphically analzying a log-file
US6115705A (en) * 1997-05-19 2000-09-05 Microsoft Corporation Relational database system and method for query processing using early aggregation
US6449618B1 (en) * 1999-03-25 2002-09-10 Lucent Technologies Inc. Real-time event processing system with subscription model
US20040143602A1 (en) * 2002-10-18 2004-07-22 Antonio Ruiz Apparatus, system and method for automated and adaptive digital image/video surveillance for events and configurations using a rich multimedia relational database
US20050015624A1 (en) * 2003-06-09 2005-01-20 Andrew Ginter Event monitoring and management
US20050033803A1 (en) * 2003-07-02 2005-02-10 Vleet Taylor N. Van Server architecture and methods for persistently storing and serving event data
US6920468B1 (en) * 1998-07-08 2005-07-19 Ncr Corporation Event occurrence detection method and apparatus
US6980963B1 (en) * 1999-11-05 2005-12-27 Ford Motor Company Online system and method of status inquiry and tracking related to orders for consumer product having specific configurations
US20060026164A1 (en) * 2004-03-31 2006-02-02 Jung Edward K Data storage for distributed sensor networks
US20060184529A1 (en) * 2005-02-16 2006-08-17 Gal Berg System and method for analysis and management of logs and events
US20060218278A1 (en) * 2005-03-24 2006-09-28 Fujitsu Limited Demand forecasting system for data center, demand forecasting method and recording medium with a demand forecasting program recorded thereon
US20060229931A1 (en) * 2005-04-07 2006-10-12 Ariel Fligler Device, system, and method of data monitoring, collection and analysis
US20060265406A1 (en) * 2005-05-20 2006-11-23 Microsoft Corporation Recognizing event patterns from event streams
US20070038889A1 (en) * 2005-08-11 2007-02-15 Wiggins Robert D Methods and systems to access process control log information associated with process control systems
US20070067575A1 (en) * 2005-09-20 2007-03-22 Morris John M Method of managing cache memory based on data temperature
US20070073743A1 (en) * 2004-02-13 2007-03-29 Memento Inc. Systems and methods for monitoring and detecting fraudulent uses of business applications
US20070073519A1 (en) * 2005-05-31 2007-03-29 Long Kurt J System and Method of Fraud and Misuse Detection Using Event Logs
US20070113031A1 (en) * 2005-11-16 2007-05-17 International Business Machines Corporation Memory management system and method for storing and retrieving messages
US20070124437A1 (en) * 2005-11-30 2007-05-31 Cisco Technology, Inc. Method and system for real-time collection of log data from distributed network components
US20070283194A1 (en) * 2005-11-12 2007-12-06 Phillip Villella Log collection, structuring and processing
US20080279113A1 (en) * 2005-03-24 2008-11-13 Janne Kalliola Information Gathering From Traffic Flow in a Communication Network
US20090172014A1 (en) * 2005-08-23 2009-07-02 Raymond John Huetter Stream-Oriented Database Machine and Method
US7620697B1 (en) * 2006-03-10 2009-11-17 Omniture, Inc. Online syndicated content feed metrics
US7673340B1 (en) * 2004-06-02 2010-03-02 Clickfox Llc System and method for analyzing system user behavior
US7827182B1 (en) * 2004-06-02 2010-11-02 Cisco Technology, Inc Searching for a path to identify where to move entries among hash tables with storage for multiple entries per bucket during insert operations
US8601112B1 (en) * 2006-03-14 2013-12-03 Amazon Technologies, Inc. Method and system for collecting and analyzing time-series data

Family Cites Families (398)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4739398A (en) * 1986-05-02 1988-04-19 Control Data Corporation Method, apparatus and system for recognizing broadcast segments
US4956774A (en) 1988-09-02 1990-09-11 International Business Machines Corporation Data base optimizer using most frequency values statistics
US5121443A (en) 1989-04-25 1992-06-09 Spectra-Physics, Inc. Neural net system for analyzing chromatographic peaks
WO1991019989A1 (en) 1990-06-21 1991-12-26 Reynolds Software, Inc. Method and apparatus for wave analysis and event recognition
US5414838A (en) * 1991-06-11 1995-05-09 Logical Information Machine System for extracting historical market information with condition and attributed windows
US5627886A (en) 1994-09-22 1997-05-06 Electronic Data Systems Corporation System and method for detecting fraudulent network usage patterns using real-time network monitoring
US5799268A (en) 1994-09-28 1998-08-25 Apple Computer, Inc. Method for extracting knowledge from online documentation and creating a glossary, index, help database or the like
US5953439A (en) 1994-11-04 1999-09-14 Ishihara; Ken Apparatus for and method of extracting time series image information
US6263290B1 (en) 1995-02-22 2001-07-17 Michael K. Williams Process and machine for signal waveform analysis
US5951541A (en) 1995-06-07 1999-09-14 Cardiogenesis Corporation Channel forming device with a secured distal extremity
US5745746A (en) * 1996-06-24 1998-04-28 International Business Machines Corporation Method for localizing execution of subqueries and determining collocation of execution of subqueries in a parallel database
WO1997016794A1 (en) 1995-11-02 1997-05-09 International Business Machines Corporation Storage plane organization and storage systems based thereon
US5862325A (en) 1996-02-29 1999-01-19 Intermind Corporation Computer-based communication system and method using metadata defining a control structure
US5761652A (en) * 1996-03-20 1998-06-02 International Business Machines Corporation Constructing balanced multidimensional range-based bitmap indices
US5751965A (en) 1996-03-21 1998-05-12 Cabletron System, Inc. Network connection status monitor and display
GB9606733D0 (en) 1996-03-29 1996-06-05 British Telecomm Collection of events within a distributed object system
JP2000507008A (en) * 1996-04-04 2000-06-06 フレア・テクノロジーズ・リミテッド Systems, software and methods for locating information in a collection of text-based information sources
FR2751448B1 (en) 1996-07-17 1999-01-15 Bull Sa METHOD FOR REAL-TIME MONITORING OF A COMPUTER SYSTEM FOR ITS ADMINISTRATION AND ASSISTANCE IN MAINTAINING IT IN OPERATION
US6760903B1 (en) 1996-08-27 2004-07-06 Compuware Corporation Coordinated application monitoring in a distributed computing environment
JP3427692B2 (en) * 1996-11-20 2003-07-22 松下電器産業株式会社 Character recognition method and character recognition device
US6816830B1 (en) 1997-07-04 2004-11-09 Xerox Corporation Finite state data structures with paths representing paired strings of tags and tag combinations
US5960434A (en) * 1997-09-26 1999-09-28 Silicon Graphics, Inc. System method and computer program product for dynamically sizing hash tables
US6363131B1 (en) 1998-04-29 2002-03-26 Hughes Electronics Corporation Method and apparatus for joint timing synchronization and frequency offset estimation
JP3213585B2 (en) * 1998-07-09 2001-10-02 株式会社インフォメックス Data search method and apparatus, data search system, recording medium
US6345283B1 (en) * 1998-07-20 2002-02-05 New Technologies Armor, Inc. Method and apparatus for forensic analysis of information stored in computer-readable media
US6356898B2 (en) 1998-08-31 2002-03-12 International Business Machines Corporation Method and system for summarizing topics of documents browsed by a user
US6285997B1 (en) 1998-11-16 2001-09-04 International Business Machines Corporation Query optimization with deferred update and autonomous sources
EP1129417A4 (en) * 1998-12-04 2004-06-30 Technology Enabling Company Ll Systems and methods for organizing data
AU2617699A (en) 1999-01-15 2000-08-01 Nokia Networks Oy Packet concatenation method and apparatus
US6516189B1 (en) * 1999-03-17 2003-02-04 Telephia, Inc. System and method for gathering data from wireless communications networks
US6751228B1 (en) 1999-03-23 2004-06-15 Yamaha Corporation Packet handler of audio data by isochronous mode
US6496831B1 (en) 1999-03-25 2002-12-17 Lucent Technologies Inc. Real-time event processing system for telecommunications and other applications
US6598078B1 (en) * 1999-04-29 2003-07-22 Aspect Communications Corporation Method and apparatus for generating a record from a time-marked information stream
US7523191B1 (en) 2000-06-02 2009-04-21 Yahoo! Inc. System and method for monitoring user interaction with web pages
US6801938B1 (en) 1999-06-18 2004-10-05 Torrent Systems, Inc. Segmentation and processing of continuous data streams using transactional semantics
CN100375088C (en) 1999-06-18 2008-03-12 国际商业机器公司 Segmentation and processing of continuous data streams using transactional semantics
US6636242B2 (en) * 1999-08-31 2003-10-21 Accenture Llp View configurer in a presentation services patterns environment
US6598087B1 (en) 1999-09-08 2003-07-22 Ge Capital Commercial Finance, Inc. Methods and apparatus for network-enabled virtual printing
US6612494B1 (en) * 1999-09-30 2003-09-02 Crossoff Incorporated Product authentication system
US6792458B1 (en) 1999-10-04 2004-09-14 Urchin Software Corporation System and method for monitoring and analyzing internet traffic
ATE344562T1 (en) * 1999-11-18 2006-11-15 Broadcom Corp TABLE LOOKUP MECHANISM FOR ADDRESS RESOLUTION IN A PACKET NETWORK SWITCH
US8635130B1 (en) 2000-02-14 2014-01-21 Td Ameritrade Ip Company, Inc. Method and system for analyzing and screening investment information
US20020029207A1 (en) 2000-02-28 2002-03-07 Hyperroll, Inc. Data aggregation server for managing a multi-dimensional database and database management system having data aggregation server integrated therein
US20030208472A1 (en) 2000-04-11 2003-11-06 Pham Peter Manh Method and apparatus for transparent keyword-based hyperlink
AU2001262958A1 (en) 2000-04-28 2001-11-12 Internet Security Systems, Inc. Method and system for managing computer security information
US7003781B1 (en) 2000-05-05 2006-02-21 Bristol Technology Inc. Method and apparatus for correlation of events in a distributed multi-system computing environment
US6505153B1 (en) 2000-05-22 2003-01-07 Compaq Information Technologies Group, L.P. Efficient method for producing off-line closed captions
US7003517B1 (en) * 2000-05-24 2006-02-21 Inetprofit, Inc. Web-based system and method for archiving and searching participant-based internet text sources for customer lead data
AU2001264928A1 (en) 2000-05-25 2001-12-03 Kanisa Inc. System and method for automatically classifying text
US6886020B1 (en) 2000-08-17 2005-04-26 Emc Corporation Method and apparatus for storage system metrics management and archive
US7576748B2 (en) 2000-11-28 2009-08-18 Nintendo Co. Ltd. Graphics system with embedded frame butter having reconfigurable pixel formats
US7146416B1 (en) 2000-09-01 2006-12-05 Yahoo! Inc. Web site activity monitoring system with tracking by categories and terms
US6993246B1 (en) * 2000-09-15 2006-01-31 Hewlett-Packard Development Company, L.P. Method and system for correlating data streams
WO2002027443A2 (en) 2000-09-25 2002-04-04 Itt Manufacturing Enterprises, Inc. Global computer network intrusion detection system
US20020046248A1 (en) 2000-10-13 2002-04-18 Honeywell International Inc. Email to database import utility
US6836890B1 (en) 2000-11-01 2004-12-28 Seebeyond Techonogy Corporation Methods and systems for message translation and parsing of data structures in a distributed component architecture
US6725287B1 (en) 2000-11-09 2004-04-20 Elity Systems, Inc. Method and system for capturing streaming data by an actionable information engine
US20020069223A1 (en) 2000-11-17 2002-06-06 Goodisman Aaron A. Methods and systems to link data
US6789046B1 (en) 2000-12-05 2004-09-07 Microsoft Corporation Performance logging solution
US6781599B2 (en) 2001-01-04 2004-08-24 At&T System and method for visualizing massive multi-digraphs
US6768994B1 (en) 2001-02-23 2004-07-27 Trimble Navigation Limited Web based data mining and location data reporting and system
US6907545B2 (en) 2001-03-02 2005-06-14 Pitney Bowes Inc. System and method for recognizing faults in machines
US20020169735A1 (en) 2001-03-07 2002-11-14 David Kil Automatic mapping from data to preprocessing algorithms
US7720958B2 (en) 2001-03-09 2010-05-18 International Business Machines Corporation Method and system for embedding correlated performance measurements for distributed application performance decomposition
US20030126613A1 (en) 2001-03-09 2003-07-03 Mcguire Todd J. System and method for visualizing user activity
US7194454B2 (en) 2001-03-12 2007-03-20 Lucent Technologies Method for organizing records of database search activity by topical relevance
US7600029B1 (en) 2001-03-15 2009-10-06 Governing Dynamics Investment, Llc System and method for generating, monitoring and reporting performance information relating to data transmissions
US7860706B2 (en) 2001-03-16 2010-12-28 Eli Abir Knowledge system method and appparatus
US6658367B2 (en) * 2001-03-28 2003-12-02 Hewlett-Packard Development Company, L.P. System for time-bucketing of baselined data collector data
US20020157017A1 (en) 2001-04-19 2002-10-24 Vigilance, Inc. Event monitoring, detection and notification system having security functions
MXPA03009815A (en) * 2001-04-26 2005-03-07 Newsgrade Corp Dynamic generation of personalized presentations of domain-specific information content.
AUPR464601A0 (en) * 2001-04-30 2001-05-24 Commonwealth Of Australia, The Shapes vector
US7191184B2 (en) * 2001-05-02 2007-03-13 National Instruments Corporation Optimized storage for measurement data
US6662176B2 (en) 2001-05-07 2003-12-09 Hewlett-Packard Development Company, L.P. Database indexing and rolling storage method for time-stamped normalized event data
US7197559B2 (en) 2001-05-09 2007-03-27 Mercury Interactive Corporation Transaction breakdown feature to facilitate analysis of end user performance of a server system
US6681184B2 (en) 2001-05-15 2004-01-20 Input/Output, Inc. System for estimating azimuthal variations in seismic data
US8019807B2 (en) 2001-05-23 2011-09-13 Wireless Enterprise Solutions Technology Limited Method and system for communication between computer systems
US7076547B1 (en) 2001-06-21 2006-07-11 Amdocs (Israel) Ltd. System and method for network performance and server application performance monitoring and for deriving exhaustive performance metrics
US7529659B2 (en) 2005-09-28 2009-05-05 Audible Magic Corporation Method and apparatus for identifying an unknown work
US7668718B2 (en) 2001-07-17 2010-02-23 Custom Speech Usa, Inc. Synchronized pattern recognition source data processed by manual or automatic means for creation of shared speaker-dependent speech user profile
US7134081B2 (en) 2001-08-14 2006-11-07 National Instruments Corporation Method and apparatus for controlling an instrumentation system
AU2002332556A1 (en) * 2001-08-15 2003-03-03 Visa International Service Association Method and system for delivering multiple services electronically to customers via a centralized portal architecture
US7278160B2 (en) 2001-08-16 2007-10-02 International Business Machines Corporation Presentation of correlated events as situation classes
US20030037034A1 (en) 2001-08-16 2003-02-20 Tim Daniels System and method for lubricants supply chain management
US20030084349A1 (en) 2001-10-12 2003-05-01 Oliver Friedrichs Early warning system for network attacks
US6763347B1 (en) * 2001-10-19 2004-07-13 Nick Zhang Indexing management for hierarchical main memory
US7107338B1 (en) 2001-12-05 2006-09-12 Revenue Science, Inc. Parsing navigation information to identify interactions based on the times of their occurrences
US6907422B1 (en) * 2001-12-18 2005-06-14 Siebel Systems, Inc. Method and system for access and display of data from large data sets
US20030126109A1 (en) 2002-01-02 2003-07-03 Tanya Couch Method and system for converting message data into relational table format
US6819117B2 (en) 2002-01-30 2004-11-16 Credence Systems Corporation PICA system timing measurement & calibration
CA2475319A1 (en) 2002-02-04 2003-08-14 Cataphora, Inc. A method and apparatus to visually present discussions for data mining purposes
US8095589B2 (en) 2002-03-07 2012-01-10 Compete, Inc. Clickstream analysis methods and systems
AU2003223379A1 (en) 2002-03-29 2003-10-13 Global Dataguard, Inc. Adaptive behavioral intrusion detection systems and methods
US20070038603A1 (en) * 2005-08-10 2007-02-15 Guha Ramanathan V Sharing context data across programmable search engines
JP2003308229A (en) 2002-04-17 2003-10-31 Nippon Telegr & Teleph Corp <Ntt> Time-series event record analysis method and apparatus, program and recording medium
DE60323086D1 (en) 2002-04-25 2008-10-02 Landmark Digital Services Llc ROBUST AND INVARIANT AUDIO COMPUTER COMPARISON
AU2003231176A1 (en) * 2002-04-29 2003-11-17 Laboratory For Computational Analytics And Semiotics, Llc Sequence miner
CA2384185A1 (en) * 2002-04-29 2003-10-29 Ibm Canada Limited-Ibm Canada Limitee Resizable cache sensitive hash table
US20040030572A1 (en) * 2002-05-03 2004-02-12 Helen Campbell Same day product and document delivery management system and process
US6993534B2 (en) * 2002-05-08 2006-01-31 International Business Machines Corporation Data store for knowledge-based data mining system
US7152242B2 (en) 2002-09-11 2006-12-19 Enterasys Networks, Inc. Modular system for detecting, filtering and providing notice about attack events associated with network security
US7085682B1 (en) 2002-09-18 2006-08-01 Doubleclick Inc. System and method for analyzing website activity
US20040057536A1 (en) * 2002-09-20 2004-03-25 Adc Dsl Systems, Inc. Digital correlator for multiple sequence detection
US7043476B2 (en) 2002-10-11 2006-05-09 International Business Machines Corporation Method and apparatus for data mining to discover associations and covariances associated with data
US7231403B1 (en) 2002-11-15 2007-06-12 Messageone, Inc. System and method for transformation and analysis of messaging data
US7184777B2 (en) * 2002-11-27 2007-02-27 Cognio, Inc. Server and multiple sensor system for monitoring activity in a shared radio frequency band
US7376969B1 (en) 2002-12-02 2008-05-20 Arcsight, Inc. Real time monitoring and analysis of events from multiple network security devices
US7127456B1 (en) 2002-12-05 2006-10-24 Ncr Corp. System and method for logging database queries
US7616666B1 (en) 2002-12-09 2009-11-10 Sprint Communications Company L.P. Method and system for customizing update-string processing in network elements
US6951541B2 (en) * 2002-12-20 2005-10-04 Koninklijke Philips Electronics, N.V. Medical imaging device with digital audio capture capability
US7454761B1 (en) 2002-12-20 2008-11-18 Cisco Technology, Inc. Method and apparatus for correlating output of distributed processes
AU2002359925B2 (en) 2002-12-26 2008-10-23 Fujitsu Limited Operation managing method and operation managing server
US7401141B2 (en) 2003-01-07 2008-07-15 International Business Machines Corporation Method and system for monitoring performance of distributed applications
US7548848B1 (en) 2003-01-08 2009-06-16 Xambala, Inc. Method and apparatus for semantic processing engine
US7369950B2 (en) 2003-02-07 2008-05-06 Power Measurement Ltd. System and method for power quality analytics
US7336890B2 (en) * 2003-02-19 2008-02-26 Microsoft Corporation Automatic detection and segmentation of music videos in an audio/video stream
US20040169688A1 (en) 2003-02-27 2004-09-02 Microsoft Corporation Multi-directional display and navigation of hierarchical data and optimization of display area consumption
US7885963B2 (en) * 2003-03-24 2011-02-08 Microsoft Corporation Free text and attribute searching of electronic program guide (EPG) data
CA2523283A1 (en) 2003-04-24 2004-11-11 Secureinfo Corporation Automated electronic software distribution and management method and system
EP2996054B1 (en) 2003-05-19 2020-07-08 Huawei Technologies Co., Ltd. Limiting scans of loosely ordered and/or grouped relations using nearly ordered maps
US7493499B1 (en) * 2003-05-22 2009-02-17 Copyright Clearance Center, Inc. Method and apparatus for secure delivery and rights management of digital content
US7174345B2 (en) * 2003-05-30 2007-02-06 Oracle International Corporation Methods and systems for auto-partitioning of schema objects
US20040254919A1 (en) 2003-06-13 2004-12-16 Microsoft Corporation Log parser
US7568025B2 (en) 2003-06-27 2009-07-28 Bank Of America Corporation System and method to monitor performance of different domains associated with a computer system or network
US7343604B2 (en) 2003-07-25 2008-03-11 International Business Machines Corporation Methods and apparatus for creation of parsing rules
US7546553B2 (en) 2003-07-28 2009-06-09 Sap Ag Grid landscape component
US7069176B2 (en) * 2003-08-07 2006-06-27 Arm Limited Trace source correlation in a data processing apparatus
WO2005015366A2 (en) 2003-08-08 2005-02-17 Electric Power Group, Llc Real-time performance monitoring and management system
US7406399B2 (en) 2003-08-26 2008-07-29 Siemens Energy & Automation, Inc. System and method for distributed reporting of machine performance
US7689394B2 (en) 2003-08-26 2010-03-30 Siemens Industry, Inc. System and method for remotely analyzing machine performance
US20060069717A1 (en) 2003-08-27 2006-03-30 Ascential Software Corporation Security service for a services oriented architecture in a data integration platform
US7158991B2 (en) * 2003-09-30 2007-01-02 Veritas Operating Corporation System and method for maintaining temporal data in data storage
US7089250B2 (en) 2003-10-08 2006-08-08 International Business Machines Corporation Method and system for associating events
US7379999B1 (en) 2003-10-15 2008-05-27 Microsoft Corporation On-line service/application monitoring and reporting system
US7457872B2 (en) * 2003-10-15 2008-11-25 Microsoft Corporation On-line service/application monitoring and reporting system
US8495099B2 (en) * 2003-10-24 2013-07-23 Enrico Maim Method of manipulating information objects and of accessing such objects in a computer environment
US20050091658A1 (en) * 2003-10-24 2005-04-28 Microsoft Corporation Operating system resource protection
US7376752B1 (en) 2003-10-28 2008-05-20 David Chudnovsky Method to resolve an incorrectly entered uniform resource locator (URL)
US20050114707A1 (en) * 2003-11-26 2005-05-26 Destefano Jason Michael Method for processing log data from local and remote log-producing devices
EP1536572A1 (en) 2003-11-26 2005-06-01 ADS Enterprises NZ Ltd. Power line communication system
US20050114331A1 (en) 2003-11-26 2005-05-26 International Business Machines Corporation Near-neighbor search in pattern distance spaces
US9401838B2 (en) 2003-12-03 2016-07-26 Emc Corporation Network event capture and retention system
JP4158717B2 (en) * 2004-02-16 2008-10-01 ソニー株式会社 Data recording / reproducing apparatus, data recording / reproducing method, program, and recording medium
US7321891B1 (en) * 2004-02-19 2008-01-22 Ncr Corp. Processing database queries
EP1574970A1 (en) * 2004-03-09 2005-09-14 Exalead Computer program for accessing information records of different applications
US8244725B2 (en) 2004-03-10 2012-08-14 Iron Mountain Incorporated Method and apparatus for improved relevance of search results
US20050223027A1 (en) 2004-03-31 2005-10-06 Lawrence Stephen R Methods and systems for structuring event data in a database for location and retrieval
US8346777B1 (en) 2004-03-31 2013-01-01 Google Inc. Systems and methods for selectively storing event data
US7664734B2 (en) * 2004-03-31 2010-02-16 Google Inc. Systems and methods for generating multiple implicit search queries
US8375444B2 (en) * 2006-04-20 2013-02-12 Fireeye, Inc. Dynamic signature creation and enforcement
US7447666B2 (en) * 2004-04-09 2008-11-04 The Boeing Company System and method for analyzing a pattern in a time-stamped event sequence
EP1739968A4 (en) 2004-04-15 2009-01-07 Panasonic Corp Content generation device and method
US20060004909A1 (en) 2004-04-30 2006-01-05 Shinya Takuwa Server system and a server arrangement method
US20050256956A1 (en) 2004-05-14 2005-11-17 Battelle Memorial Institute Analyzing user-activity data using a heuristic-based approach
US7363577B2 (en) 2004-05-18 2008-04-22 Novell, Inc. Techniques for serializing events
US7870200B2 (en) * 2004-05-29 2011-01-11 Ironport Systems, Inc. Monitoring the flow of messages received at a server
US7434058B2 (en) 2004-06-07 2008-10-07 Reconnex Corporation Generating signatures over a document
US7577959B2 (en) 2004-06-24 2009-08-18 International Business Machines Corporation Providing on-demand capabilities using virtual machines and clustering processes
US7301603B2 (en) 2004-06-24 2007-11-27 Taiwan Semiconductor Manufacturing Co., Ltd. Exposure system and method
US7596571B2 (en) 2004-06-30 2009-09-29 Technorati, Inc. Ecosystem method of aggregation and search and related techniques
US20060008516A1 (en) * 2004-07-07 2006-01-12 Sebastiano Scarampi Gilsonite derived pharmaceutical delivery compositions and methods: cosmetic applications
US7962489B1 (en) * 2004-07-08 2011-06-14 Sage-N Research, Inc. Indexing using contiguous, non-overlapping ranges
US7747641B2 (en) 2004-07-09 2010-06-29 Microsoft Corporation Modeling sequence and time series data in predictive analytics
US7856441B1 (en) 2005-01-10 2010-12-21 Yahoo! Inc. Search systems and methods using enhanced contextual queries
US7487072B2 (en) * 2004-08-04 2009-02-03 International Business Machines Corporation Method and system for querying multimedia data where adjusting the conversion of the current portion of the multimedia data signal based on the comparing at least one set of confidence values to the threshold
GB2434670B (en) 2004-08-13 2008-06-11 Remasys Pty Ltd Monitoring and management of distributed information systems
US7559053B2 (en) 2004-08-24 2009-07-07 Microsoft Corporation Program and system performance data correlation
US7653742B1 (en) 2004-09-28 2010-01-26 Entrust, Inc. Defining and detecting network application business activities
US7403866B2 (en) 2004-10-06 2008-07-22 Telefonaktiebolaget L M Ericsson (Publ) High-resolution, timer-efficient sliding window
US7406462B2 (en) 2004-10-19 2008-07-29 International Business Machines Corporation Prediction of query difficulty for a generic search engine
US8677274B2 (en) * 2004-11-10 2014-03-18 Apple Inc. Highlighting items for search results
US7493304B2 (en) 2004-11-12 2009-02-17 International Business Machines Corporation Adjusting an amount of data logged for a query based on a change to an access plan
US20060161816A1 (en) 2004-12-22 2006-07-20 Gula Ronald J System and method for managing events
US7809131B1 (en) 2004-12-23 2010-10-05 Arcsight, Inc. Adjusting sensor time in a network security system
US7471646B2 (en) 2005-01-10 2008-12-30 Microsoft Corporation System and methods for inline property editing in tree view based editors
US7783679B2 (en) 2005-01-12 2010-08-24 Computer Associates Think, Inc. Efficient processing of time series data
KR100717166B1 (en) * 2005-02-16 2007-05-11 삼성전자주식회사 Service framework for A Home network
US8023937B2 (en) 2005-03-04 2011-09-20 Qualcomm Incorporated Apparatus and methods for determining voice and/or data processing performance of a wireless device
US20060197766A1 (en) * 2005-03-07 2006-09-07 Digital Fuel Technologies, Inc. System for interpretation of streaming data filters
US7369954B2 (en) 2005-03-17 2008-05-06 International Business Machines Corporation Event tracing with time stamp compression and history buffer based compression
US7853585B2 (en) 2005-03-17 2010-12-14 International Business Machines Corporation Monitoring performance of a data processing system
US7779340B2 (en) 2005-03-17 2010-08-17 Jds Uniphase Corporation Interpolated timestamps in high-speed data capture and analysis
US7908314B2 (en) 2005-03-23 2011-03-15 Hitachi, Ltd. Method for controlling a management computer
US7218974B2 (en) 2005-03-29 2007-05-15 Zarpac, Inc. Industrial process data acquisition and analysis
US7810155B1 (en) 2005-03-30 2010-10-05 Symantec Corporation Performance enhancement for signature based pattern matching
US20060224583A1 (en) 2005-03-31 2006-10-05 Google, Inc. Systems and methods for analyzing a user's web history
US7290077B2 (en) 2005-04-07 2007-10-30 International Business Machines Corporation Event queue structure and method
US20060259519A1 (en) 2005-05-12 2006-11-16 Microsoft Corporation Iterative definition of flat file data structure by using document instance
US7617314B1 (en) * 2005-05-20 2009-11-10 Network General Technology HyperLock technique for high-speed network data monitoring
US9792351B2 (en) 2005-06-10 2017-10-17 International Business Machines Corporation Tolerant and extensible discovery of relationships in data using structural information and data analysis
WO2007002820A2 (en) 2005-06-28 2007-01-04 Yahoo! Inc. Search engine with augmented relevance ranking by community participation
US8266117B2 (en) * 2005-07-11 2012-09-11 Fujitsu Limited Method and apparatus for tracing data in audit trail, and computer product
US7926099B1 (en) 2005-07-15 2011-04-12 Novell, Inc. Computer-implemented method and system for security event transport using a message bus
US7818313B1 (en) * 2005-07-18 2010-10-19 Sybase, Inc. Method for distributing processing of queries over a cluster of servers in a continuous processing system
US20070033632A1 (en) * 2005-07-19 2007-02-08 March Networks Corporation Temporal data previewing system
US7937344B2 (en) 2005-07-25 2011-05-03 Splunk Inc. Machine data web
ATE504872T1 (en) * 2005-07-26 2011-04-15 Macdonald Dettwiler & Associates Inc GUIDANCE, NAVIGATION AND CONTROL SYSTEM FOR A VEHICLE
US20070043562A1 (en) * 2005-07-29 2007-02-22 David Holsinger Email capture system for a voice recognition speech application
US7529739B2 (en) * 2005-08-19 2009-05-05 Google Inc. Temporal ranking scheme for desktop searching
JP4381361B2 (en) * 2005-08-31 2009-12-09 株式会社東芝 Time-series data analysis device, time-series data analysis method, and time-series data analysis program
US8209242B1 (en) * 2005-09-02 2012-06-26 Flowvision, Llc Inventory management system
US8671031B2 (en) * 2005-09-02 2014-03-11 Flowvision, Llc Inventory management system
US7991758B2 (en) 2005-09-07 2011-08-02 International Business Machines Corporation System and method for performing a search operation within a sequential access data storage subsystem
US20070067323A1 (en) 2005-09-20 2007-03-22 Kirstan Vandersluis Fast file shredder system and method
US7512610B1 (en) 2005-09-30 2009-03-31 Miosoft Corporation Processing data records
US8196150B2 (en) 2005-10-07 2012-06-05 Oracle International Corporation Event locality using queue services
JP4581962B2 (en) * 2005-10-27 2010-11-17 株式会社日立製作所 Information retrieval system, index management method and program
WO2007059469A2 (en) 2005-11-10 2007-05-24 Computer Associates Think, Inc. System and method for delivering results of a search query in an information management system
US20110314148A1 (en) 2005-11-12 2011-12-22 LogRhythm Inc. Log collection, structuring and processing
US20070112754A1 (en) 2005-11-15 2007-05-17 Honeywell International Inc. Method and apparatus for identifying data of interest in a database
US7970742B2 (en) * 2005-12-01 2011-06-28 Oracle International Corporation Techniques for implementing indexes on columns in database tables whose values specify periods of time
US9135304B2 (en) * 2005-12-02 2015-09-15 Salesforce.Com, Inc. Methods and systems for optimizing text searches over structured data in a multi-tenant environment
US20070156786A1 (en) * 2005-12-22 2007-07-05 International Business Machines Corporation Method and apparatus for managing event logs for processes in a digital data processing system
US8392400B1 (en) * 2005-12-29 2013-03-05 Amazon Technologies, Inc. Method and apparatus for stress management in a searchable data service
US7689600B2 (en) 2005-12-30 2010-03-30 Sap Ag System and method for cluster file system synchronization
US7395187B2 (en) 2006-02-06 2008-07-01 International Business Machines Corporation System and method for recording behavior history for abnormality detection
US8386469B2 (en) 2006-02-16 2013-02-26 Mobile Content Networks, Inc. Method and system for determining relevant sources, querying and merging results from multiple content sources
US7979439B1 (en) * 2006-03-14 2011-07-12 Amazon Technologies, Inc. Method and system for collecting and analyzing time-series data
US8438170B2 (en) 2006-03-29 2013-05-07 Yahoo! Inc. Behavioral targeting system that generates user profiles for target objectives
US8589949B2 (en) 2006-05-01 2013-11-19 International Business Machines Corporation Processing multiple heterogeneous event types in a complex event processing engine
US7979245B1 (en) 2006-05-17 2011-07-12 Quest Software, Inc. Model-based systems and methods for monitoring computing resource performance
US7676288B2 (en) 2006-06-23 2010-03-09 Invensys Systems, Inc. Presenting continuous timestamped time-series data values for observed supervisory control and manufacturing/production parameters
US8046749B1 (en) 2006-06-27 2011-10-25 The Mathworks, Inc. Analysis of a sequence of data in object-oriented environments
US7580938B1 (en) 2006-06-29 2009-08-25 Emc Corporation Methods and apparatus facilitating efficient display of data
US8904299B1 (en) * 2006-07-17 2014-12-02 The Mathworks, Inc. Graphical user interface for analysis of a sequence of data in object-oriented environment
US7580944B2 (en) 2006-07-27 2009-08-25 Yahoo! Inc. Business intelligent architecture system and method
US8051162B2 (en) 2006-07-28 2011-11-01 Hewlett-Packard Development Company, L.P. Data assurance in server consolidation
US7970934B1 (en) * 2006-07-31 2011-06-28 Google Inc. Detecting events of interest
JP4788533B2 (en) 2006-09-06 2011-10-05 ヤマハ株式会社 Key drive control system
US7917240B2 (en) 2006-09-29 2011-03-29 Fisher-Rosemount Systems, Inc. Univariate method for monitoring and analysis of multivariate data
EP3493074A1 (en) 2006-10-05 2019-06-05 Splunk Inc. Time series search engine
JP5218060B2 (en) * 2006-10-06 2013-06-26 日本電気株式会社 Information retrieval system, information retrieval method and program
US8005992B1 (en) * 2006-10-13 2011-08-23 Cadence Design Systems, Inc. Scalable storage and retrieval of multiple asynchronous signals
US8909622B1 (en) 2006-10-21 2014-12-09 Sprint Communications Company L.P. Time-based log and alarm integration search tool for trouble-shooting
US7689610B2 (en) 2006-12-01 2010-03-30 Computer Associates Think, Inc. Automated grouping of messages provided to an application using string similarity analysis
US20080148280A1 (en) 2006-12-13 2008-06-19 Stillwell Joseph W Apparatus, system, and method for autonomically managing multiple queues
US7925678B2 (en) 2007-01-12 2011-04-12 Loglogic, Inc. Customized reporting and mining of event data
US8321448B2 (en) * 2007-02-22 2012-11-27 Microsoft Corporation Click-through log mining
US8949266B2 (en) 2007-03-07 2015-02-03 Vlingo Corporation Multiple web-based content category searching in mobile search application
US8793704B2 (en) 2007-03-09 2014-07-29 Microsoft Corporation Techniques to manage event notifications
US7680916B2 (en) 2007-04-24 2010-03-16 Hyperformix, Inc. System for improving the performance of a computer software application in a server network
US8200527B1 (en) 2007-04-25 2012-06-12 Convergys Cmg Utah, Inc. Method for prioritizing and presenting recommendations regarding organizaion's customer care capabilities
US7974728B2 (en) 2007-05-04 2011-07-05 Taiwan Semiconductor Manufacturing Company, Ltd. System for extraction of key process parameters from fault detection classification to enable wafer prediction
US20100179953A1 (en) * 2007-06-05 2010-07-15 Masaki Kan Information presentation system, information presentation method, and program for information presentation
US7797309B2 (en) 2007-06-07 2010-09-14 Datamaxx Applied Technologies, Inc. System and method for search parameter data entry and result access in a law enforcement multiple domain security environment
US20080319975A1 (en) * 2007-06-22 2008-12-25 Microsoft Corporation Exploratory Search Technique
US8073806B2 (en) 2007-06-22 2011-12-06 Avaya Inc. Message log analysis for system behavior evaluation
US20090003219A1 (en) 2007-06-29 2009-01-01 General Instrument Corporation Monitoring Performance Metrics Associated With a Wireless Network
US7783750B1 (en) 2007-07-30 2010-08-24 Hewlett-Packard Development Company, L.P. System and method for externalized real-time log correlation and performance monitoring of service-oriented applications
AU2008286192B2 (en) 2007-08-06 2014-04-10 Mls Technologies Pty Ltd Method and/or system for searching network content
US7979362B2 (en) 2007-08-10 2011-07-12 Motorola Solutions, Inc. Interactive data mining system
US8645390B1 (en) 2007-08-31 2014-02-04 Google Inc. Reordering search query results in accordance with search context specific predicted performance functions
US8046353B2 (en) 2007-11-02 2011-10-25 Citrix Online Llc Method and apparatus for searching a hierarchical database and an unstructured database with a single search query
US20090138435A1 (en) * 2007-11-26 2009-05-28 Leslie Mannion Techniques for searching and presenting search results
EP2248003A1 (en) 2007-12-31 2010-11-10 Netapp, Inc. System and method for automatic storage load balancing in virtual server environments
US20090177692A1 (en) 2008-01-04 2009-07-09 Byran Christopher Chagoly Dynamic correlation of service oriented architecture resource relationship and metrics to isolate problem sources
JP5034955B2 (en) 2008-01-08 2012-09-26 富士通株式会社 Performance evaluation simulation apparatus, performance evaluation simulation method, and performance evaluation simulation program
JP4995104B2 (en) 2008-01-16 2012-08-08 株式会社日立製作所 Performance monitoring condition setting / management method and computer system using the method
US8380692B2 (en) * 2008-01-25 2013-02-19 Nuance Communications, Inc. Fast index with supplemental store
US8175863B1 (en) 2008-02-13 2012-05-08 Quest Software, Inc. Systems and methods for analyzing performance of virtual environments
US8013859B2 (en) 2008-03-20 2011-09-06 Vmware, Inc. Graphical display for illustrating effectiveness of resource management and resource balancing
US8031634B1 (en) 2008-03-31 2011-10-04 Emc Corporation System and method for managing a virtual domain environment to enable root cause and impact analysis
US8326796B2 (en) 2008-04-15 2012-12-04 International Business Machines Corporation Data retrieval and data feed generation from data sources
US7797415B2 (en) 2008-04-29 2010-09-14 Computer Associates Think, Inc. Automatic context-based baselining for transactions
US9053159B2 (en) 2009-02-09 2015-06-09 The Boeing Company Non-conformance analysis using an associative memory learning agent
US8214842B2 (en) 2009-02-27 2012-07-03 International Business Machines Corporation Visualization-centric performance-based volume allocation
US8271615B2 (en) 2009-03-31 2012-09-18 Cloud Connex, Llc Centrally managing and monitoring software as a service (SaaS) applications
US8271633B2 (en) 2009-04-16 2012-09-18 Exfo Service Assurance Inc. Correlating network transactions
JP4951034B2 (en) 2009-06-25 2012-06-13 株式会社日立製作所 Computer system and its operation information management method
US8880524B2 (en) 2009-07-17 2014-11-04 Apple Inc. Scalable real time event stream processing
US9092411B2 (en) 2009-08-18 2015-07-28 Miosoft Corporation Understanding data in data sets
US9037555B2 (en) 2009-11-12 2015-05-19 Bmc Software, Inc. Asynchronous collection and correlation of trace and communications event data
US8707194B1 (en) 2009-11-20 2014-04-22 Amazon Technologies, Inc. System and method for decentralized performance monitoring of host systems
US8245140B2 (en) 2009-12-31 2012-08-14 International Business Machines Corporation Visualization and consolidation of virtual machines in a virtualized data center
US20110179160A1 (en) 2010-01-21 2011-07-21 Microsoft Corporation Activity Graph for Parallel Programs in Distributed System Environment
US8924376B1 (en) * 2010-01-31 2014-12-30 Bryant Christopher Lee Method for human ranking of search results
US8229953B2 (en) 2010-03-25 2012-07-24 Oracle International Corporation Metric correlation and analysis
JP5423553B2 (en) 2010-04-09 2014-02-19 株式会社日立製作所 Database management method, computer, sensor network system, and database search program
US8813039B2 (en) * 2010-04-14 2014-08-19 International Business Machines Corporation Method and system for software defect reporting
US9124488B2 (en) 2010-04-21 2015-09-01 Vmware, Inc. Method and apparatus for visualizing the health of datacenter objects
US20110298804A1 (en) 2010-06-07 2011-12-08 Hao Ming C Visual display of data from a plurality of data sources
US9846628B2 (en) 2010-06-15 2017-12-19 Microsoft Technology Licensing, Llc Indicating parallel operations with user-visible events
US8522167B2 (en) 2010-08-09 2013-08-27 Microsoft Corporation Relationship visualization and graphical interaction model in it client management
US20120078925A1 (en) 2010-09-27 2012-03-29 International Business Machines Corporation Searching within log files
US8914601B1 (en) * 2010-10-18 2014-12-16 Reservoir Labs, Inc. Systems and methods for a fast interconnect table
US9378111B2 (en) 2010-11-11 2016-06-28 Sap Se Method and system for easy correlation between monitored metrics and alerts
US8842119B2 (en) 2010-11-17 2014-09-23 Hewlett-Packard Development Company, L.P. Displaying system performance information
US20120130774A1 (en) 2010-11-18 2012-05-24 Dror Daniel Ziv Analyzing performance using video analytics
US9152726B2 (en) * 2010-12-01 2015-10-06 Microsoft Technology Licensing, Llc Real-time personalized recommendation of location-related entities
JP5678620B2 (en) * 2010-12-03 2015-03-04 株式会社日立製作所 Data processing method, data processing system, and data processing apparatus
JP5871193B2 (en) 2010-12-24 2016-03-01 日本電気株式会社 Monitoring data analysis apparatus, monitoring data analysis method, and monitoring data analysis program
US8667496B2 (en) 2011-01-04 2014-03-04 Host Dynamics Ltd. Methods and systems of managing resources allocated to guest virtual machines
US8412696B2 (en) 2011-01-31 2013-04-02 Splunk Inc. Real time searching and reporting
US8589375B2 (en) 2011-01-31 2013-11-19 Splunk Inc. Real time searching and reporting
US9990215B2 (en) 2011-02-22 2018-06-05 Vmware, Inc. User interface for managing a virtualized computing environment
US9053221B2 (en) 2011-03-04 2015-06-09 International Business Machines Corporation Promotion of performance parameters in distributed data processing environment
US20120239681A1 (en) 2011-03-14 2012-09-20 Splunk Inc. Scalable interactive display of distributed data
US8615773B2 (en) 2011-03-31 2013-12-24 Honeywell International Inc. Systems and methods for coordinating computing functions to accomplish a task using a configuration file and standardized executable application modules
US8782192B2 (en) 2011-05-31 2014-07-15 Red Hat, Inc. Detecting resource consumption events over sliding intervals in cloud-based network
US9495270B2 (en) 2011-05-31 2016-11-15 Vmware, Inc. User interface for monitoring the status of an object in a virtual computing environment
US20120317266A1 (en) 2011-06-07 2012-12-13 Research In Motion Limited Application Ratings Based On Performance Metrics
US20120323941A1 (en) 2011-06-17 2012-12-20 Microsoft Corporation Processing Queries for Event Data in a Foreign Representation
US8812555B2 (en) * 2011-06-18 2014-08-19 Microsoft Corporation Dynamic lock-free hash tables
US8589543B2 (en) 2011-07-01 2013-11-19 Cisco Technology, Inc. Virtual data center monitoring
EP2557503B1 (en) 2011-07-28 2020-04-01 Tata Consultancy Services Ltd. Application performance measurement and reporting
WO2013023030A2 (en) 2011-08-10 2013-02-14 Opnet Technologies, Inc. Application performance analysis that is adaptive to business activity patterns
US8510807B1 (en) 2011-08-16 2013-08-13 Edgecast Networks, Inc. Real-time granular statistical reporting for distributed platforms
US9927958B2 (en) 2011-08-25 2018-03-27 Vmware, Inc. User interface for networks including virtual machines
US8850512B2 (en) 2011-10-13 2014-09-30 Mcafee, Inc. Security assessment of virtual machine environments
US10810218B2 (en) 2011-10-14 2020-10-20 Transunion, Llc System and method for matching of database records based on similarities to search queries
US8793118B2 (en) 2011-11-01 2014-07-29 PES School of Engineering Adaptive multimodal communication assist system
US20130124714A1 (en) 2011-11-11 2013-05-16 Vmware, Inc. Visualization of combined performance metrics
US20130239124A1 (en) 2012-01-20 2013-09-12 Mentor Graphics Corporation Event Queue Management For Embedded Systems
US9020976B2 (en) 2012-01-31 2015-04-28 Bank Of America Corporation System and method for processing data in diverse storage systems
US9172608B2 (en) 2012-02-07 2015-10-27 Cloudera, Inc. Centralized configuration and monitoring of a distributed computing cluster
EP2645257A3 (en) 2012-03-29 2014-06-18 Prelert Ltd. System and method for visualisation of behaviour within computer infrastructure
US20130262656A1 (en) 2012-03-30 2013-10-03 Jin Cao System and method for root cause analysis of mobile network performance problems
US9501849B2 (en) 2012-05-11 2016-11-22 Vmware, Inc. Multi-dimensional visualization tool for browsing and troubleshooting at scale
US10387448B2 (en) 2012-05-15 2019-08-20 Splunk Inc. Replication of summary data in a clustered computing environment
US8516008B1 (en) * 2012-05-18 2013-08-20 Splunk Inc. Flexible schema column store
US8682925B1 (en) 2013-01-31 2014-03-25 Splunk Inc. Distributed high performance analytics store
EP2856348B1 (en) 2012-06-04 2019-10-23 EntIT Software LLC User-defined loading of data onto a database
US9766962B2 (en) 2012-06-07 2017-09-19 Vmware, Inc. Correlating performance degradation of applications to specific changes made to applications
US10387201B2 (en) 2012-06-26 2019-08-20 Vmware, Inc. Storage performance-based virtual machine placement
US8620928B1 (en) 2012-07-16 2013-12-31 International Business Machines Corporation Automatically generating a log parser given a sample log
US9183529B2 (en) 2012-08-01 2015-11-10 Oracle International Corporation Business intelligence performance analysis system
US10741285B2 (en) 2012-08-16 2020-08-11 Ginger.io, Inc. Method and system for providing automated conversations
US8825664B2 (en) * 2012-08-17 2014-09-02 Splunk Inc. Indexing preview
US9753909B2 (en) 2012-09-07 2017-09-05 Splunk, Inc. Advanced field extractor with multiple positive examples
US9594814B2 (en) 2012-09-07 2017-03-14 Splunk Inc. Advanced field extractor with modification of an extracted field
US8788525B2 (en) 2012-09-07 2014-07-22 Splunk Inc. Data model for machine data for semantic search
US9582585B2 (en) 2012-09-07 2017-02-28 Splunk Inc. Discovering fields to filter data returned in response to a search
US9384056B2 (en) 2012-09-11 2016-07-05 Red Hat Israel, Ltd. Virtual resource allocation and resource and consumption management
US9047352B1 (en) 2012-09-28 2015-06-02 Emc Corporation Centralized searching in a data storage environment
US9634904B2 (en) 2012-12-13 2017-04-25 Level 3 Communications, Llc Framework supporting content delivery with hybrid content delivery services
US8874740B2 (en) 2013-01-04 2014-10-28 Fmr Llc Customer experience monitor
US9467464B2 (en) 2013-03-15 2016-10-11 Tenable Network Security, Inc. System and method for correlating log data to discover network vulnerabilities and assets
US9154397B2 (en) 2013-03-15 2015-10-06 Microsoft Technology Licensing, Llc Methods and computer program products for transaction relationships between application servers
US8904389B2 (en) 2013-04-30 2014-12-02 Splunk Inc. Determining performance states of components in a virtual machine environment based on performance states of related subcomponents
US20140324862A1 (en) 2013-04-30 2014-10-30 Splunk Inc. Correlation for user-selected time ranges of values for performance metrics of components in an information-technology environment with log data from that information-technology environment
US10019496B2 (en) 2013-04-30 2018-07-10 Splunk Inc. Processing of performance data and log data from an information technology environment by using diverse data stores
US10318541B2 (en) 2013-04-30 2019-06-11 Splunk Inc. Correlating log data with performance measurements having a specified relationship to a threshold value
US9495187B2 (en) 2013-04-30 2016-11-15 Splunk, Inc. Interactive, top-down presentation of the architecture and performance of a hypervisor environment
US10997191B2 (en) 2013-04-30 2021-05-04 Splunk Inc. Query-triggered processing of performance data and log data from an information technology environment
US8683467B2 (en) 2013-04-30 2014-03-25 Splunk Inc. Determining performance states of parent components in a virtual-machine environment based on performance states of related child components
US10614132B2 (en) 2013-04-30 2020-04-07 Splunk Inc. GUI-triggered processing of performance data and log data from an information technology environment
US10225136B2 (en) 2013-04-30 2019-03-05 Splunk Inc. Processing of log data and performance data obtained via an application programming interface (API)
US10346357B2 (en) 2013-04-30 2019-07-09 Splunk Inc. Processing of performance data and structure data from an information technology environment
US10353957B2 (en) 2013-04-30 2019-07-16 Splunk Inc. Processing of performance data and raw log data from an information technology environment
US9164786B2 (en) 2013-04-30 2015-10-20 Splunk Inc. Determining performance states of parent components in a virtual-machine environment based on performance states of related child components during a time period
US20150379065A1 (en) 2013-05-10 2015-12-31 Hitachi, Ltd. Database server storing plurality of versions of data, and database management method
US9116901B2 (en) 2013-05-23 2015-08-25 Oracle International Corporation Database snapshot analysis
US9215240B2 (en) 2013-07-25 2015-12-15 Splunk Inc. Investigative and dynamic detection of potential security-threat indicators from events in big data
US9268805B2 (en) * 2013-09-10 2016-02-23 International Business Machines Corporation Managing reusable artifacts using placeholders
US9582527B2 (en) * 2013-10-28 2017-02-28 Pivotal Software, Inc. Compacting data file histories
WO2015065435A1 (en) 2013-10-31 2015-05-07 Hewlett-Packard Development Company, L.P. Storing time series data for a search query
US20150143180A1 (en) 2013-11-21 2015-05-21 Microsoft Corporation Validating software characteristics
US9361329B2 (en) 2013-12-13 2016-06-07 International Business Machines Corporation Managing time series databases
US9437022B2 (en) 2014-01-27 2016-09-06 Splunk Inc. Time-based visualization of the number of events having various values for a field
US9838346B2 (en) 2014-03-17 2017-12-05 Splunk Inc. Alerting on dual-queue systems
DE102014104711A1 (en) * 2014-04-02 2015-10-08 RWTH Aachen - Körperschaft des öffentlichen Rechts Method for producing a cohesive joint connection and structural element
US10693742B2 (en) 2014-04-15 2020-06-23 Splunk Inc. Inline visualizations of metrics related to captured network data
US10523521B2 (en) 2014-04-15 2019-12-31 Splunk Inc. Managing ephemeral event streams generated from captured network data
US10360196B2 (en) 2014-04-15 2019-07-23 Splunk Inc. Grouping and managing event streams generated from captured network data
US10700950B2 (en) 2014-04-15 2020-06-30 Splunk Inc. Adjusting network data storage based on event stream statistics
US9251221B1 (en) 2014-07-21 2016-02-02 Splunk Inc. Assigning scores to objects based on search query results
US9047246B1 (en) 2014-07-31 2015-06-02 Splunk Inc. High availability scheduler
US10133806B2 (en) 2014-07-31 2018-11-20 Splunk Inc. Search result replication in a search head cluster
US9767197B1 (en) 2014-08-20 2017-09-19 Vmware, Inc. Datacenter operations using search and analytics
WO2016032489A1 (en) 2014-08-28 2016-03-03 Landmark Graphics Corporation Optimizing multistage hydraulic fracturing design based on three-dimensional (3d) continuum damage mechanics
US10127258B2 (en) 2014-09-30 2018-11-13 Splunk Inc. Event time selection output techniques
US9922099B2 (en) 2014-09-30 2018-03-20 Splunk Inc. Event limited field picker
US9210056B1 (en) 2014-10-09 2015-12-08 Splunk Inc. Service monitoring interface
US9146962B1 (en) 2014-10-09 2015-09-29 Splunk, Inc. Identifying events using informational fields
US9158811B1 (en) 2014-10-09 2015-10-13 Splunk, Inc. Incident review interface
US9245057B1 (en) 2014-10-09 2016-01-26 Splunk Inc. Presenting a graphical visualization along a time-based graph lane using key performance indicators derived from machine data
US9130832B1 (en) 2014-10-09 2015-09-08 Splunk, Inc. Creating entity definition from a file
US9146954B1 (en) 2014-10-09 2015-09-29 Splunk, Inc. Creating entity definition from a search result set
US20160125314A1 (en) * 2014-10-30 2016-05-05 Yahoo! Inc. Systems and methods for native advertisement selection and formatting
WO2016070111A1 (en) 2014-10-31 2016-05-06 Bedrock Data, Inc. Cross-platform data synchronization
US9922037B2 (en) 2015-01-30 2018-03-20 Splunk Inc. Index time, delimiter based extractions and previewing for use in indexing
US10200824B2 (en) * 2015-05-27 2019-02-05 Apple Inc. Systems and methods for proactively identifying and surfacing relevant content on a touch-sensitive device
US9384203B1 (en) * 2015-06-09 2016-07-05 Palantir Technologies Inc. Systems and methods for indexing and aggregating data records
US10255136B2 (en) 2015-09-21 2019-04-09 International Business Machines Corporation Data backup management during workload migration
DE102015219866A1 (en) 2015-10-13 2017-04-13 Eos Gmbh Electro Optical Systems Device and method for producing a three-dimensional object
US11604795B2 (en) * 2016-09-26 2023-03-14 Splunk Inc. Distributing partial results from an external data system between worker nodes
US10545964B2 (en) 2017-01-30 2020-01-28 Splunk Inc. Multi-phased data execution in a data processing system
US11061918B2 (en) 2017-04-05 2021-07-13 Splunk Inc. Locating and categorizing data using inverted indexes
US10776361B2 (en) 2017-04-07 2020-09-15 Salesforce.Com, Inc. Time series database search system
US10417044B2 (en) 2017-04-21 2019-09-17 International Business Machines Corporation System interventions based on expected impacts of system events on scheduled work units
US10346315B2 (en) * 2017-05-26 2019-07-09 Oracle International Corporation Latchless, non-blocking dynamically resizable segmented hash index
US10789232B2 (en) 2017-08-25 2020-09-29 Vmware, Inc. Method and system for generating a query plan for time series data
US10855793B2 (en) 2017-09-25 2020-12-01 Splunk Inc. Proxying hypertext transfer protocol (HTTP) requests for microservices
US11308088B2 (en) * 2017-12-28 2022-04-19 Aveva Software, Llc Associating comments with tags in a managed historian system
US10776441B1 (en) 2018-10-01 2020-09-15 Splunk Inc. Visual programming for iterative publish-subscribe message processing system
US11238048B1 (en) 2019-07-16 2022-02-01 Splunk Inc. Guided creation interface for streaming data processing pipelines
US11663176B2 (en) 2020-07-31 2023-05-30 Splunk Inc. Data field extraction model training for a data intake and query system
US20220036177A1 (en) 2020-07-31 2022-02-03 Splunk Inc. Data field extraction by a data intake and query system

Patent Citations (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5745693A (en) * 1992-07-01 1998-04-28 Mci Corporation System for gathering and reporting real time data from an IDNX communications network
US5347540A (en) * 1992-07-08 1994-09-13 Tektronix, Inc. Dynamic storage allocation in a logic analyzer
US5847972A (en) * 1993-09-24 1998-12-08 Eick; Stephen Gregory Method and apparatus for graphically analzying a log-file
US5613113A (en) * 1993-10-08 1997-03-18 International Business Machines Corporation Consistent recreation of events from activity logs
US5737600A (en) * 1994-09-12 1998-04-07 International Business Machines Corporation Method and system for log management in a coupled data processing system
US6115705A (en) * 1997-05-19 2000-09-05 Microsoft Corporation Relational database system and method for query processing using early aggregation
US6920468B1 (en) * 1998-07-08 2005-07-19 Ncr Corporation Event occurrence detection method and apparatus
US6449618B1 (en) * 1999-03-25 2002-09-10 Lucent Technologies Inc. Real-time event processing system with subscription model
US6980963B1 (en) * 1999-11-05 2005-12-27 Ford Motor Company Online system and method of status inquiry and tracking related to orders for consumer product having specific configurations
US20040143602A1 (en) * 2002-10-18 2004-07-22 Antonio Ruiz Apparatus, system and method for automated and adaptive digital image/video surveillance for events and configurations using a rich multimedia relational database
US20050015624A1 (en) * 2003-06-09 2005-01-20 Andrew Ginter Event monitoring and management
US20050033803A1 (en) * 2003-07-02 2005-02-10 Vleet Taylor N. Van Server architecture and methods for persistently storing and serving event data
US20070073743A1 (en) * 2004-02-13 2007-03-29 Memento Inc. Systems and methods for monitoring and detecting fraudulent uses of business applications
US20060026164A1 (en) * 2004-03-31 2006-02-02 Jung Edward K Data storage for distributed sensor networks
US7827182B1 (en) * 2004-06-02 2010-11-02 Cisco Technology, Inc Searching for a path to identify where to move entries among hash tables with storage for multiple entries per bucket during insert operations
US7673340B1 (en) * 2004-06-02 2010-03-02 Clickfox Llc System and method for analyzing system user behavior
US20060184529A1 (en) * 2005-02-16 2006-08-17 Gal Berg System and method for analysis and management of logs and events
US20060218278A1 (en) * 2005-03-24 2006-09-28 Fujitsu Limited Demand forecasting system for data center, demand forecasting method and recording medium with a demand forecasting program recorded thereon
US20080279113A1 (en) * 2005-03-24 2008-11-13 Janne Kalliola Information Gathering From Traffic Flow in a Communication Network
US20060229931A1 (en) * 2005-04-07 2006-10-12 Ariel Fligler Device, system, and method of data monitoring, collection and analysis
US7627544B2 (en) * 2005-05-20 2009-12-01 Microsoft Corporation Recognizing event patterns from event streams
US20060265406A1 (en) * 2005-05-20 2006-11-23 Microsoft Corporation Recognizing event patterns from event streams
US20070073519A1 (en) * 2005-05-31 2007-03-29 Long Kurt J System and Method of Fraud and Misuse Detection Using Event Logs
US20070038889A1 (en) * 2005-08-11 2007-02-15 Wiggins Robert D Methods and systems to access process control log information associated with process control systems
US20090172014A1 (en) * 2005-08-23 2009-07-02 Raymond John Huetter Stream-Oriented Database Machine and Method
US20070067575A1 (en) * 2005-09-20 2007-03-22 Morris John M Method of managing cache memory based on data temperature
US20070283194A1 (en) * 2005-11-12 2007-12-06 Phillip Villella Log collection, structuring and processing
US20070113031A1 (en) * 2005-11-16 2007-05-17 International Business Machines Corporation Memory management system and method for storing and retrieving messages
US20070124437A1 (en) * 2005-11-30 2007-05-31 Cisco Technology, Inc. Method and system for real-time collection of log data from distributed network components
US7620697B1 (en) * 2006-03-10 2009-11-17 Omniture, Inc. Online syndicated content feed metrics
US8601112B1 (en) * 2006-03-14 2013-12-03 Amazon Technologies, Inc. Method and system for collecting and analyzing time-series data

Cited By (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11249971B2 (en) 2006-10-05 2022-02-15 Splunk Inc. Segmenting machine data using token-based signatures
US11144526B2 (en) 2006-10-05 2021-10-12 Splunk Inc. Applying time-based search phrases across event data
US9928262B2 (en) 2006-10-05 2018-03-27 Splunk Inc. Log data time stamp extraction and search on log data real-time monitoring environment
US11550772B2 (en) 2006-10-05 2023-01-10 Splunk Inc. Time series search phrase processing
US11537585B2 (en) 2006-10-05 2022-12-27 Splunk Inc. Determining time stamps in machine data derived events
US11526482B2 (en) 2006-10-05 2022-12-13 Splunk Inc. Determining timestamps to be associated with events in machine data
US9747316B2 (en) 2006-10-05 2017-08-29 Splunk Inc. Search based on a relationship between log data and data from a real-time monitoring environment
US9922067B2 (en) 2006-10-05 2018-03-20 Splunk Inc. Storing log data as events and performing a search on the log data and data obtained from a real-time monitoring environment
US10891281B2 (en) 2006-10-05 2021-01-12 Splunk Inc. Storing events derived from log data and performing a search on the events and data that is not log data
US10977233B2 (en) 2006-10-05 2021-04-13 Splunk Inc. Aggregating search results from a plurality of searches executed across time series data
US10740313B2 (en) 2006-10-05 2020-08-11 Splunk Inc. Storing events associated with a time stamp extracted from log data and performing a search on the events and data that is not log data
US10747742B2 (en) 2006-10-05 2020-08-18 Splunk Inc. Storing log data and performing a search on the log data and data that is not log data
US11947513B2 (en) 2006-10-05 2024-04-02 Splunk Inc. Search phrase processing
US11561952B2 (en) 2006-10-05 2023-01-24 Splunk Inc. Storing events derived from log data and performing a search on the events and data that is not log data
US10346357B2 (en) 2013-04-30 2019-07-09 Splunk Inc. Processing of performance data and structure data from an information technology environment
US11250068B2 (en) 2013-04-30 2022-02-15 Splunk Inc. Processing of performance data and raw log data from an information technology environment using search criterion input via a graphical user interface
US10614132B2 (en) 2013-04-30 2020-04-07 Splunk Inc. GUI-triggered processing of performance data and log data from an information technology environment
US10997191B2 (en) 2013-04-30 2021-05-04 Splunk Inc. Query-triggered processing of performance data and log data from an information technology environment
US10592522B2 (en) 2013-04-30 2020-03-17 Splunk Inc. Correlating performance data and log data using diverse data stores
US11119982B2 (en) 2013-04-30 2021-09-14 Splunk Inc. Correlation of performance data and structure data from an information technology environment
US10877987B2 (en) 2013-04-30 2020-12-29 Splunk Inc. Correlating log data with performance measurements using a threshold value
US10353957B2 (en) 2013-04-30 2019-07-16 Splunk Inc. Processing of performance data and raw log data from an information technology environment
US10019496B2 (en) 2013-04-30 2018-07-10 Splunk Inc. Processing of performance data and log data from an information technology environment by using diverse data stores
US10225136B2 (en) 2013-04-30 2019-03-05 Splunk Inc. Processing of log data and performance data obtained via an application programming interface (API)
US10318541B2 (en) 2013-04-30 2019-06-11 Splunk Inc. Correlating log data with performance measurements having a specified relationship to a threshold value
US10877986B2 (en) 2013-04-30 2020-12-29 Splunk Inc. Obtaining performance data via an application programming interface (API) for correlation with log data
US11782989B1 (en) 2013-04-30 2023-10-10 Splunk Inc. Correlating data based on user-specified search criteria
US10922298B2 (en) 2019-01-30 2021-02-16 Walmart Apollo, Llc System and method for indexing time-series-based data

Also Published As

Publication number Publication date
US10891281B2 (en) 2021-01-12
US9996571B2 (en) 2018-06-12
US20170344591A1 (en) 2017-11-30
US20210103575A1 (en) 2021-04-08
EP2074505A4 (en) 2010-01-13
US20180225319A1 (en) 2018-08-09
US9594789B2 (en) 2017-03-14
CN101641674B (en) 2012-10-10
US20170139963A1 (en) 2017-05-18
CN102831214B (en) 2017-05-10
US20160140128A1 (en) 2016-05-19
US11947513B2 (en) 2024-04-02
US20170139968A1 (en) 2017-05-18
US20230205749A1 (en) 2023-06-29
US11550772B2 (en) 2023-01-10
US8112425B2 (en) 2012-02-07
WO2008043082A2 (en) 2008-04-10
US20080215546A1 (en) 2008-09-04
EP2074505A2 (en) 2009-07-01
CN102831214A (en) 2012-12-19
US10977233B2 (en) 2021-04-13
US20200174986A1 (en) 2020-06-04
US10740313B2 (en) 2020-08-11
US11526482B2 (en) 2022-12-13
US20130060783A1 (en) 2013-03-07
US9928262B2 (en) 2018-03-27
KR20090080065A (en) 2009-07-23
US20210248122A1 (en) 2021-08-12
US10242039B2 (en) 2019-03-26
US20120117079A1 (en) 2012-05-10
US20170337230A1 (en) 2017-11-23
US10678767B2 (en) 2020-06-09
US20150149480A1 (en) 2015-05-28
US20170046403A1 (en) 2017-02-16
US9922067B2 (en) 2018-03-20
KR101380936B1 (en) 2014-04-10
US20180246919A1 (en) 2018-08-30
KR20120059625A (en) 2012-06-08
WO2008043082A3 (en) 2009-04-09
US9922066B2 (en) 2018-03-20
US9514175B2 (en) 2016-12-06
US20180157693A1 (en) 2018-06-07
US10216779B2 (en) 2019-02-26
JP2010506311A (en) 2010-02-25
US20160070736A1 (en) 2016-03-10
US20160154836A1 (en) 2016-06-02
US10262018B2 (en) 2019-04-16
US11144526B2 (en) 2021-10-12
EP2482209A1 (en) 2012-08-01
US10255312B2 (en) 2019-04-09
US20170139961A1 (en) 2017-05-18
US20160140238A1 (en) 2016-05-19
US20170337231A1 (en) 2017-11-23
US20220156244A1 (en) 2022-05-19
US20190171629A1 (en) 2019-06-06
US11561952B2 (en) 2023-01-24
US11249971B2 (en) 2022-02-15
US9922065B2 (en) 2018-03-20
US10747742B2 (en) 2020-08-18
US11537585B2 (en) 2022-12-27
EP3493074A1 (en) 2019-06-05
US20150339351A1 (en) 2015-11-26
US9747316B2 (en) 2017-08-29
US8990184B2 (en) 2015-03-24
US9002854B2 (en) 2015-04-07
US20190213180A1 (en) 2019-07-11
US20210248123A1 (en) 2021-08-12
CN101641674A (en) 2010-02-03
KR101266267B1 (en) 2013-05-23
US20190171630A1 (en) 2019-06-06
JP5264740B2 (en) 2013-08-14
US20180173739A1 (en) 2018-06-21
US20170046402A1 (en) 2017-02-16
US20190163673A1 (en) 2019-05-30

Similar Documents

Publication Publication Date Title
US11947513B2 (en) Search phrase processing

Legal Events

Date Code Title Description
AS Assignment

Owner name: SPLUNK INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BAUM, MICHAEL J.;CARASSO, DAVID;DAS, ROBIN K.;AND OTHERS;SIGNING DATES FROM 20080403 TO 20080417;REEL/FRAME:043354/0960

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION