有多种转换HTML表的方法 into使用JSON,CSV或Excel电子表格 GrabzIt的PHP API,这里详细介绍了一些最有用的技术。 但是,在开始之前,请记住 URLToTable, HTMLToTable or 文件到表格 方法 Save or SaveTo 必须调用方法来捕获表。 如果您想快速查看此服务是否适合您,可以尝试 捕获HTML表的实时演示 从URL。
以下代码示例自动转换在指定网页中发现的第一个HTML表 intoa CSV文件。
$grabzIt->URLToTable("https://www.tesla.com"); //Then call the Save or SaveTo method
$grabzIt->HTMLToTable("<html><body><table><tr><th>Name</th><th>Age</th></tr> <tr><td>Tom</td><td>23</td></tr><tr><td>Nicola</td><td>26</td></tr> </table></body></html>"); //Then call the Save or SaveTo method
$grabzIt->FileToTable("tables.html"); //Then call the Save or SaveTo method
默认情况下,它将转换它标识的第一个表 intoa表。 但是,可以通过将2传递给网页中的第二个表来进行转换 setTableNumberToInclude
方法。
$grabzIt = new \GrabzIt\GrabzItClient("Sign in to view your Application Key", "Sign in to view your Application Secret"); $options = new \GrabzIt\GrabzItTableOptions(); $options->setTableNumberToInclude(2); $grabzIt->URLToTable("https://www.tesla.com", $options); //Then call the Save or SaveTo method $grabzIt->SaveTo("result.csv");
$grabzIt = new \GrabzIt\GrabzItClient("Sign in to view your Application Key", "Sign in to view your Application Secret"); $options = new \GrabzIt\GrabzItTableOptions(); $options->setTableNumberToInclude(2); $grabzIt->HTMLToTable("<html><body><table><tr><th>Name</th><th>Age</th></tr> <tr><td>Tom</td><td>23</td></tr><tr><td>Nicola</td><td>26</td></tr> </table></body></html>", $options); //Then call the Save or SaveTo method $grabzIt->SaveTo("result.csv");
$grabzIt = new \GrabzIt\GrabzItClient("Sign in to view your Application Key", "Sign in to view your Application Secret"); $options = new \GrabzIt\GrabzItTableOptions(); $options->setTableNumberToInclude(2); $grabzIt->FileToTable("tables.html", $options); //Then call the Save or SaveTo method $grabzIt->SaveTo("result.csv");
您也可以使用 setTargetElement
确保只转换指定元素ID中的表的方法。
$grabzIt = new \GrabzIt\GrabzItClient("Sign in to view your Application Key", "Sign in to view your Application Secret"); $options = new \GrabzIt\GrabzItTableOptions(); $options->setTargetElement("stocks_table"); $grabzIt->URLToTable("https://www.tesla.com", $options); //Then call the Save or SaveTo method $grabzIt->SaveTo("result.csv");
$grabzIt = new \GrabzIt\GrabzItClient("Sign in to view your Application Key", "Sign in to view your Application Secret"); $options = new \GrabzIt\GrabzItTableOptions(); $options->setTargetElement("stocks_table"); $grabzIt->HTMLToTable("<html><body><table id='stocks_table'><tr><th>Name</th><th>Age</th></tr> <tr><td>Tom</td><td>23</td></tr><tr><td>Nicola</td><td>26</td></tr> </table></body></html>", $options); //Then call the Save or SaveTo method $grabzIt->SaveTo("result.csv");
$grabzIt = new \GrabzIt\GrabzItClient("Sign in to view your Application Key", "Sign in to view your Application Secret"); $options = new \GrabzIt\GrabzItTableOptions(); $options->setTargetElement("stocks_table"); $grabzIt->FileToTable("tables.html", $options); //Then call the Save or SaveTo method $grabzIt->SaveTo("result.csv");
另外,您也可以通过将true传递给 setIncludeAllTables
方法,但是仅适用于XLSX和JSON格式。 此选项会将每个表放在生成的电子表格工作簿中的新表中。
$grabzIt = new \GrabzIt\GrabzItClient("Sign in to view your Application Key", "Sign in to view your Application Secret"); $options = new \GrabzIt\GrabzItTableOptions(); $options->setFormat('xlsx'); $options->setIncludeAllTables(true); $grabzIt->URLToTable("https://www.tesla.com", $options); //Then call the Save or SaveTo method $grabzIt->SaveTo("result.xlsx");
$grabzIt = new \GrabzIt\GrabzItClient("Sign in to view your Application Key", "Sign in to view your Application Secret"); $options = new \GrabzIt\GrabzItTableOptions(); $options->setFormat('xlsx'); $options->setIncludeAllTables(true); $grabzIt->HTMLToTable("<html><body><table><tr><th>Name</th><th>Age</th></tr> <tr><td>Tom</td><td>23</td></tr><tr><td>Nicola</td><td>26</td></tr> </table></body></html>", $options); //Then call the Save or SaveTo method $grabzIt->SaveTo("result.xlsx");
$grabzIt = new \GrabzIt\GrabzItClient("Sign in to view your Application Key", "Sign in to view your Application Secret"); $options = new \GrabzIt\GrabzItTableOptions(); $options->setFormat('xlsx'); $options->setIncludeAllTables(true); $grabzIt->FileToTable("tables.html", $options); //Then call the Save or SaveTo method $grabzIt->SaveTo("result.xlsx");
有时需要以编程方式读取HTML表格,GrabzIt使您可以通过转换在线HTML表格来使用PHP来执行此操作 into JSON。 为此,请指定 json
作为格式参数。 例如,在下面的示例中,我们正在转换HTML表 同步地 然后使用内置 json_decode
PHP解析JSON的方法 string into我们可以轻松使用的对象。
$grabzIt = new \GrabzIt\GrabzItClient("Sign in to view your Application Key", "Sign in to view your Application Secret"); $options = new \GrabzIt\GrabzItTableOptions(); $options->setFormat("json"); $options->setTableNumberToInclude(1); $grabzIt->URLToTable("https://www.tesla.com", $options); $json = $grabzIt->SaveTo(); if ($json != null) { $tableObj = json_decode($json); }
$grabzIt = new \GrabzIt\GrabzItClient("Sign in to view your Application Key", "Sign in to view your Application Secret"); $options = new \GrabzIt\GrabzItTableOptions(); $options->setFormat("json"); $options->setTableNumberToInclude(1); $grabzIt->HTMLToTable("<html><body><table><tr><th>Name</th><th>Age</th></tr> <tr><td>Tom</td><td>23</td></tr><tr><td>Nicola</td><td>26</td></tr> </table></body></html>", $options); $json = $grabzIt->SaveTo(); if ($json != null) { $tableObj = json_decode($json); }
$grabzIt = new \GrabzIt\GrabzItClient("Sign in to view your Application Key", "Sign in to view your Application Secret"); $options = new \GrabzIt\GrabzItTableOptions(); $options->setFormat("json"); $options->setTableNumberToInclude(1); $grabzIt->FileToTable("tables.html", $options); $json = $grabzIt->SaveTo(); if ($json != null) { $tableObj = json_decode($json); }
您可以将自定义标识符传递给 表 方法,如下所示,然后将该值返回到您的GrabzIt PHP处理程序。 例如,此自定义标识符可以是数据库标识符,从而允许将提取的表与特定的数据库记录相关联。
$grabzIt = new \GrabzIt\GrabzItClient("Sign in to view your Application Key", "Sign in to view your Application Secret"); $options = new \GrabzIt\GrabzItTableOptions(); $options->setCustomId(123456); $grabzIt->URLToTable("https://www.tesla.com", $options); //Then call the Save method $grabzIt->Save("http://www.example.com/handler.php");
$grabzIt = new \GrabzIt\GrabzItClient("Sign in to view your Application Key", "Sign in to view your Application Secret"); $options = new \GrabzIt\GrabzItTableOptions(); $options->setCustomId(123456); $grabzIt->HTMLToTable("<html><body><h1>Hello World!</h1></body></html>", $options); //Then call the Save method $grabzIt->Save("http://www.example.com/handler.php");
$grabzIt = new \GrabzIt\GrabzItClient("Sign in to view your Application Key", "Sign in to view your Application Secret"); $options = new \GrabzIt\GrabzItTableOptions(); $options->setCustomId(123456); $grabzIt->FileToTable("example.html", $options); //Then call the Save method $grabzIt->Save("http://www.example.com/handler.php");