--- title: 'Extract' description: 'Extract structured data from the page' icon: 'brain-circuit' --- `extract()` grabs structured text from the current page using [zod](https://github.com/colinhacks/zod). Given instructions and `schema`, you will receive structured data. We strongly suggest you set `useTextExtract` to `true` if you are extracting data from a longer body of text. ### `extract` a single object Here is how an `extract` call might look for a single object: ```javascript const item = await page.extract({ instruction: "extract the price of the item", schema: z.object({ price: z.number(), }), }); ``` Your output schema will look like: ``` { price: number } ``` ### `extract` a list of objects Here is how an `extract` call might look for a list of objects. Note that you need to wrap the `z.array` in an outer `z.object`. ```javascript const apartments = await stagehand.page.extract({ instruction: "Extract ALL the apartment listings and their details, including address, price, and square feet." schema: z.object({ list_of_apartments: z.array( z.object({ address: z.string(), price: z.string(), square_feet: z.string(), }), ), }} }) console.log("the apartment data is: ", apartments)); ``` Your output schema will look like: ``` list_of_apartments: [ { address: "street address here", price: "$1234.00", square_feet: "700" }, { address: "another address here", price: "1010.00", square_feet: "500" }, . . . ] ``` To provide some additional context at the field level within your schema, you can use `.describe()`. See the snippet below: ```javascript const apartments = await stagehand.page.extract({ instruction: "Extract ALL the apartment listings and their details, including address, price, and square feet." schema: z.object({ list_of_apartments: z.array( z.object({ address: z.string().describe("the address of the apartment"), price: z.string().describe("the price of the apartment"), square_feet: z.string().describe("the square footage of the apartment"), }), ), }} }) ``` ### **Arguments:** [`ExtractOptions`](https://github.com/browserbase/stagehand/blob/main/types/stagehand.ts) Provides instructions for extraction Defines the structure of the data to extract This method converts the page to text, which is much cleaner for LLMs than the DOM. However, it may not work for use cases that involve DOM metadata elements. An xpath that can be used to reduce the scope of an extraction. If an xpath is passed in, `extract` will only process the contents of the HTML element that the xpath points to. Useful for reducing input tokens and increasing extraction accuracy. Only works when `useTextExtract: true`. Specifies the model to use Configuration options for the model client. See [`ClientOptions`](https://github.com/browserbase/stagehand/blob/main/types/model.ts). Timeout in milliseconds for waiting for the DOM to settle ### **Returns:** [`Promise>`](https://github.com/browserbase/stagehand/blob/main/types/stagehand.ts) Resolves to the structured data as defined by the provided `schema`.